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Harold Hotelling 


Harold Hotelling was bom in Fulda, Minn., on September 29, 
1895. While he was still a small boy his family moved to Seattle, 
Wash., where he spent most of his youth. 

After finishing high school in Seattle he worked on small 
weekly newspapers in Washington from 1915 to 1916. He was 
graduated from the University of Washington in 1919 and took 
an M. A. degree from the same institution in 1921. He earned 
his doctorate in mathematics at Princeton University in 1924. 
As a graduate student he taught mathematics at both Washing- 
ton and Princeton. 

His first permanent academic position was that of mathematical 
consultant at the Stanford Food Research Institute, a position 
he held from 1924 until 1927. He became associate professor 
of mathematics at Stanford University in 1927 and continued 
in this position until 1931. In 1929 he spent several months at 
the Rothamsted Experiment Station, working with Sir Ronald 
Fisher. 

In 1931 he left Stanford University to become professor of 
economics at Columbia University. During his years at Columbia 
he served as head of the Statistical Research Group of the 
Division of War Research, and was instrumental in the founding 
of the Department of Mathematical Statistics at Columbia. 

In 1946 he accepted his present position as Professor of Mathe- 
matical Statistics and Associate Director of the Institute of 
Statistics at the University of North Carolina. He was sub- 
sequently given the additional post of Professor of Economics, 
and in this capacity he teaches courses in mathematical economics. 

Many honors and recognitions have been awarded him. He 
is past president of the Econometric Society, the Indian Statistical 
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Congress, and the Institute of Mathematical Statistics. He 
served as vice president of the American Statistical Association 
and Section K of the American Association for the Advancement 
of Science. He was awarded an honorary Doctor of Laws degree 
by the University of Chicago in 1955. 

A bare recital of biographical facts does little more than hint 
at the important contributions that Professor Hotelling has 
made to the mathematical and social sciences. Few men can 
match his mastery of mathematics, statistics, and economics, 
and fewer yet can match his record of major contributions in all 
three fields. In the field of economics, his contributions to the 
theory of utility and demand, welfare economics, and the 
incidence of taxation have already assumed the stature of 
classics. In addition, his contributions to statistics have found 
applications in econometrics as well as in other areas of the 
physical, biological, and social sciences. 

Professor Hotelling's pioneering work in the teaching of 
mathematical statistics and mathematical economics was 
fundamental to achieving recognition for these important 
disciplines in American universities. Here one may cite the 
important part he played in developing these areas at Columbia 
University and the University of North Carolina. The generations 
of graduate students who have worked with him also indicate 
the major role he has played in developing mathematical 
statistics and mathematical economics in the United States. 

The essays in this volume, dealing with topics in mathematical 
economics and econometrics, are offered by the authors as 
indications of their respect for the contributions that Harold 
Hotelling has made to economics. The authors, some of them 
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former students and all of them long-time associates of Professor 
Hotelling, also offer these essays as tokens of their warm personal 
regard for him. 
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PART I 

On the Mathematics of Optimization 




PAUL A. SAMUELSON 


Structure of a Minimum Equilibrium System 


1. The functions met with in classical thermodynamics possess 
an abstract structure shared by equilibrium systems in quite 
diverse other fields — e.g., statics, economic theory, the recently 
developed mathematical theory of games and linear programming, 
etc. 1 ) Given a set of numerical data on punch cards purporting 
to come from an equilibrium system sharing these properties, 
we should be able to determine whether such a supposition is 
tenable. Once the full empirical implications of the hypotheses 
embodied in the equilibrium theories have been elucidated, we 
can, if we wish, dispense with that theory except for mnemonic 
purposes. 

By use of matrix notation, the present paper gives a brief 
but exhaustive summary of the properties of such systems, 
confining itself for simplicity to the regular case of unique and 
differentiable functions. Then it develops an exhaustive set of 
conditions that any finite set of observations must meet if the 
theory in question is not to be refuted. 

2. Imagine that we are given a set of 2n variables that come 
in conjugate pairs {x^y^. (In phenomenological thermodynamics 
these might be designated as generalized coordinates and 
generalized forces, such as entropy and temperature, volume and 
negative pressure, chemical mass and chemical potential, etc.) 2 ) 

J ) For the thermodynamic and economic interpretation of the mathematical 
relations here analyzed see such references as E. A. Guggenheim, Thermodynamics 
(Amsterdam, 1950); L. Tisza, “On the General Theory of Phase Transitions,” 
in Phase Transformations in Solids (Wiley, 1951, N.Y.), pp. 1—37; P. A. 
Samuelson, Foundations of Economic Analysis (Cambridge, Mass. 1947), Ch. 3. 

2 ) To avoid indeterminacy of scale, all extensive quantities can be thought 
of as expressed m ratio to the amount of an extensive variable, x n +v which 
is not included in this analysis. 
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There is assum ed to exist n equilibrium conditions or equations 
of state between these 2 n variables, defined as follows: 

Axiom for a Regular Equilibrium System: Corresponding 
to prescribed values for n variables (y v . . y n ), values of the 
conjugate variables (x v . . x n ) are determined so as to 
provide a regular minimum to F(x; y) = F(x v . . x n ; 

y» • • •> y n ) = YK ■ ■ ■> *») - Ii so that 

(1) dF/dxt — 0 = Yi(x v . . x n ) — y 2 (i = 1, . . n) 

where subscripts to functions always stand for partial deriva- 
tives with respect to the indicated variables, and where the 
Hessian matrix of second derivatives of F and Y with respect 
to the x*s is everywhere defined and is positive definite so as 
to insure a regular minimum. 

Knowledge of the function Y above determines all properties 
of the equilibrium system. (In thermodynamics, Y is internal 
energy.) These regularity conditions of smoothness and con- 
vexity could be relaxed. But for the present it is sufficient to 
deduce the full implications of the regular case. We have im- 
mediately from the Axiom: 

Theorem 1: Functions constraining a set of variables are 
equivalent to a regular equilibrium system, if and only if, 
they can be thrown into the form: 

(2) a:,- = X i (y 1 , . . y n ) ss X t (y v . . y n ) ( i = 1, . . n), 

where X(y v . . ., y n ) is an existent twice differentiable func- 
tion, whose Hessian matrix of partial derivatives — which is 
also the symmetric Jacobian of the transformation (2) — must 
be positive definite. 

The necessity part of this theorem follows immediately from 
the fact that the relations (1) can always be inverted by virtue 
of the fact that their Jacobian is postulated to be the non- 
singular matrix of a positive definite quadratic form. Equations 
(2) represent such inverse functions, and their Jacobian 
[Xj] = [y«]- x - Since the inverse of a symmetric matrix must be 
symmetric, our right to set X i = X t is established and the 
integrability or symmetry conditions guarantee the existence 
of the function X(y v . , ., y n ) = £ J X i dx t independently of 
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path, which is unique up to an inessential integration constant. 
Finally, because the inverse of a positive definite matrix must 
be positive definite, this stipulated property is indeed necessary. 

The sufficiency part of the theorem can be most easily seen 
by recognizing the dualism between the xs and y* s. If we 
interchanged the letters in (2), we should have a relation with 
all the properties of (1); call it (1)* and call (2)* the relation 
(1) with the labels interchanged. Then by the just proved 
necessary condition, it follows that (1)* implies (2)*, which is 
all that we need for our sufficiency proof of Theorem 1. This 
also shows the truth of 

Corollary 1: The variables (x v . . ., x n ) and (y v . . ., y n ) 
are mutually conjugate: the same equilibrium system can be 
defined by the dual minimum problem. 

F*(y,x) = X{y 1 , . . y n ) — Ya x ,Vs> a minimum for pre- 
scribed xs. 

3. The equilibrium conditions of (1), or (2), were seen to be 
n constraints among 2n variables. There are a vast number of 
ways of taking as prescribable variables n out of 2n variables, 
namely (2n)l/(nl) 2 permutations. But in the general case, there 
is no reason to believe that any n variables taken at random 
can be independently varied. Thus, in the simple case where 
Y(x v x 2> . . .) = \^x], the equations of state are x { = y lt and 
the Hessians of Y and of X are the identity matrixes. Clearly 
for this system, the variables (x v y v x 3) . . .) cannot be indepen- 
dently varied. 

In general, the only admissible sets that we can be sure of are 
those generated by the following rule: Select one and only one 
member from each conjugate fair of variables. Since there are 
two independent ways of selecting from each pair, there are 
obviously exactly 2 n such admissible sets of independently 
prescribable variables. If we follow the convention of suitably 
renumbering the variables, we can write any of the 2 n sets in 
one of the following n + 1 forms: 

iVl* * • Vn)> {i/v * • •> Vn—1> ^ n )> • * •> il/lf ■ • •> Vm> '*'m+ 1> * • ^ n)» 

. . (y,; x it . . x n ), (x v . . x„). 

The first and last of these are the “unmixed” sets of (1) and (2).. 
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That the remaining "mixed” sets are admissible as independently 
prescribable follows from the fact that every principal minor 
of the definite Jacobians of (1) and (2) are necessarily definite 
and non-singu lar . This assures us that the first m equations of 
(1) can be solved explicitly for (x v . . x m ) in terms of 
(Vi, ■ ■ •> y m > x m +v •••.*•) since the crucial matrix of partial 
derivatives [dyjdxf] is the non-singular m rowed principal 
min or of [Y 4J ]; with these x’s solved for, we can substitute in 
the remaining n — m equations of (1) to get {y m+1 , . . y n ) in 
terms of the same independent variables. (A dual similar ar- 
gument could be made concerning the solvability for ( y m+v ...,y n ) 
of the last n — m equations of (2).) Thus, by use of the Implicit 
Function Theorem — and by an extension of it that gives no- 
where vanis hing principal minors as sufficient conditions for 
uniqueness of our solutions in the large — we can deduce. 

Theorem 2: For any of the 2 n admissible sets of n independent 
variables, consisting of one and only one member of each pair 
of conjugate variables, we can in regular systems — and only 
in such systems — determine unique and differentiable values of 
the remaining variables. After we have suitably renumbered 
the variables, we can write these relations as 

/3) X * = • • ” &»' * m+1 ’ ‘ ^ = 

' ] Vi = V • • •> Vm. *m+ 1 . • • - *») (* = » + 1 . • • M n), 

0 <*m 


J m 
(4) 


r y- 1 —Y~ l Y - 

XX > ■* XX xx 

V Y” 1 V V V— 1 V 

L " X - 1 XX * X XX * XX ^ XX * X 


with the partitioned Jacobian matrix of skew-symmetric form 
dx dx~] 
dy dx 
dy dy I 
dy dxA 

P "^^527 X^ y y^^yy ‘Xyfl) X. QyX. yy 

y-l y 

I— yy X\. yq 

L -B' C J ’ 


x: 


i r y- 1 

XX 

~Y£Y« 

Y —1 Y 

■* *— yy ^ yQ 

-J 


0 ^ m ^ 


n, 


Y xx being understood to stand for the first wbym rows and 
columns of [Y w ], . . X vg for the last n — m rows and columns 
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of [X {j ], etc.; and the symmetric diagonal blocks A and C 
being positive definite. 3 ) 

In (3) the position of the semi-colon will serve to identify 
the value of m, except in the limiting cases where m = 0 or n, 
in which case we have the unmixed variables and equations of 
(1) and (2). To prove the various relations of (4), we make the 
appropriate matrix substitutions (or matrix premultiplica- 
tions) in the differentials relation of (1) and (2) 



to get the first row's relations of (4), which are seen to be dual 
forms. The next row of (4) immediately follows. The last form 
is important: it summarizes the facts that the diagonal blocks 
are symmetrical, but that the off-diagonal matrixes are skew- 
symmetric with (dx/dx) = — (dy/dy)'. These properties 
follow from the symmetry relations [Y w ] = [Y H ], while the 
definiteness of the positive blocks follows from the definiteness 
of [Y {j ] and [X {j ] and of the inverses of their principal minors. 

To prove the converse part of the theorem — that the structure 
of (3) and (4) implies the structure of (1) or (2) — note that 
by the same routine substitutions by which (5) went into (4), 
but working in reverse, we find for the Jacobian of (y, y) in 
terms of (x, x) 

3 ) If determinacy of scale has not been achieved by the device of the first 
footnote, then Y (x v . . x n ) will be a homogeneous function of order one, and 
its n 2 Hessian [Y„-] will be positive definite of rank n — 1. The functions 
(2) will then be determinate only in ratio and not in scale. For m < n, the 
functions (3) will be perfectly determinate, and can be shown to be homo- 
geneous of order one in [: x m+v . . x n ] alone, requiring C to be positive semi- 
definite of rank (n — m — 1) with [x m+1 , C = 0 and \x x , . . x n ] = 

[ x m+l> • • •» 
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( 6 ) 


J n 


~dy Sy~ 

dx’dx _ r A ' 1 

dy dy *- — B' A~ l 

_dx ’ dx_ 


-A-t-B ' 
C+B'A-'B_ 


This is seen to be symmetric. It remains to show that in- 
dependently of B, the positive definiteness of C and A will 
guarantee that this whole matrix is positive definite, and hence 
of the stipulated regular form 

We make use of the easily demonstrable algebraic fact that 
any positive definite A~ l can be written in an infinity of dif- 
ferent ways as D'D, where D is non-singular. 

Now consider the quadratic form 

Q = — [^l* • • •> Z m> z m+ 1> • • •> Z n]Jn[ Z l’ 4 4 •> Z m> Z m+1> 4 4 •> Z n\ 

— [■ z m+l> 4 4 •’ Z n]C[z m +i, ■ • •> z n] 

[ D'D, -D'[DB)-\\ Z }' 

+ [2i. • • - Z J [_ ( DB y D> (. db)' (DB ) J z : 

= [0, . . ., 0, z m+ i, . . C[0, . . ., 0, z m+1 , . . WW 

where W = [z v . . z m ]D' — [z m+1 , . . z n ](DB)'. 

is positive semi-definite because C was positive definite 
by hypothesis; Q 2 is a sum of squares and hence positive semi- 
definite. But for z'z ^ 0, where Q^ — 0, Q z > 0 and where 
Q 2 = 0 ,Q X > 0; hence, Q is finally proved to be positive definite. 

4. Returning now to the mixed relations (3), their skew- 
symmetry property warns us that, as they stand, these functions 
are not partial derivatives of any existent parent function. 
However, because of the special skew-symmetry between the 
off-diagonal blocks, it is clear that changing the algebraic sign 
of the two upper blocks will give us a symmetric matrix 

<’> [Zi 1 ? } 

which can be regarded as the second-derivative Hessian matrix 
of an existent function for each m. Hence 

Theorem 3: Any regular equilibrium system in 2 n conjugate 
variables has associated with it, for each of the 2" admissible 
sets of independent variables, a characterizing function, which 
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may be written 

P(!t» * * ’> Vm* *^?n4T> * * *» *^n) 

( 8 ) 



where the line integral is independent of path by virtue of 
the symmetry relations. In terms of the relevant characterizing 
function, we can rewrite equations (3) 

fS w *< — ~ dp (yi> ■ ■ •> x n)ltyi> i = 

y. = -f-aPfeq, . . y m ; . . x n )/dx 0 i = m + 1, . . ., n. 

Corollary 1: For m — 0, P= Y; and for m = n, P = —X; 
for all m and for values of the 2 n variables satisfying the n 
equilibrium conditions, 

m n 

(9) P(y v . . ., y m ; ar m+1 , ...,x n ) = Y — J i y j x j = -X+ ^y,z t ; 

1 m+1 

but these are not identities in the 2 n variable space. 

Corollary 2: For prescribed values of (x v . . ., x m ;y m+1 , . . .,y n ) 
the expression 

m n 

(10) F m {x, x; y, y) = P(&, . . ., y m \ x m+1 .. ., 

X w+1 


reaches a regular saddlepoint or minimax when, and only 
when, the remaining variables (y v . . ., y m ; x m+1 , . . ., x„) are in 
their regular equilibrium configuration; i.e., for an equilibrium 
position (xl . . .; . . x°; y $, . . ... y° n ) 

(11) F m (xP, x°; y, y 0 ) ^ F m (x°, «°; y°, y°) ^ F m (x°, x; y°, y°) 


with the inequalities valid for distinct points. 

In classical thermodynamics these 2 n characterizing func- 
tions have a variety of notations! descriptions and go by such 
names as internal energy, enthropy, enthalpy, Helmholtz free 
energy, Gibbs potential or free energy, etc. Knowledge of any 
one of them provides a succinct summary of all the regular 
equilibrium relations, and by use of the transformations of (6) 
and (4) we can in principle go from the properties of any one to 
the properties of any other. 
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The symmetry properties of (4) provides immediate proof of 
the existence of each of these characterizing functions described 
in theorem 3. To prove Corollary 1, we may use the properties 
of Legendre transformations or of differential geometry; but 
remaining within the limits of the present straightforward 
algebraic manipulation of Jacobian matrices, we make the 
equilibrium substitutions to get 

m rn 

Y — — Y •••> • ■ * * •> ^n) 

I 1 

and 

m 

d{Y - 2Vi X i) m 

— i = 2 C Y}-yM- x i = °~ x i> (* = • • -.*») 

dtii i 

(i2) ; 

HY-Zy^) m 

r— = 2(Y i—y^i+Y i=°+2/<» (* — m+1, 

W; l 


which is seen to be identical with P if the inessential integration 
constant is appropriately adjusted. By using the dualism 
between Y and X, the relations of P to — (X — 2m+i x iVi) 
shown in (9) are verified. 

To prove the saddlepoint property of Corollary 2, we note the 
equivalence of 


(13) 


dF m 

dtii 


dF n 

0 {i = 1, . . m); -r=-= 0 (» = m + 1, . . n) 


to (3)'; and we note that in changing the algebraic signs in the 
upper blocks of (4), we achieve (7)'s symmetry but at the cost 
of making its upper diagonal block — A = — YjJ be negative 
rather than positive definite. Together with the positive defini- 
teness of C = X“y, this is sufficient for the saddle-point or 
minimax property. 

This minimax property shows us that the notion of an un- 
mixed set of conjugate variables (y v . . ., y n ) and (x v . . x n ) 
is not arbitrary. We are generally forced to group y x and y % 
together and are not free to interchange the x and y designations 
of y x and x v (Thus, entropy and volume belong together in a 
sense that temperature and volume do not; this we know from 
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the mathematics as well as from the physics.) For if we link 
up admissible sets of conjugate variables without distinguishing 
which are x’s and which are y’s, the Jacobian (4) of the resulting 
transformation (3) will have a determinable number of rows and 
columns that yield skew-symmetry. Thus, mis a determinable 
number which can be verified to be 0 or n or to be in between. 
This enables us to find the proper way of dividing the 2 n con- 
jugate variables into two unmixed groups. Of course, which we 
call x and y has no mathematical significance because of the 
complete dualism between them. (In the singular case where 
whole rows and corresponding columns of J m vanish, we shall 
later show that the exact size of m is indeterminable but in- 
different, since the system then splits off into independent 
constellations.) 

5. Theorem 3 and its corollaries each contains within itself, 
and is equivalent to, the previous theorems and Axiom. If any 
2n variables can be labelled so that n of them can be expressed 
in terms of the remaining n by a transformation with the regular 
minimax properties of (11), such a system must have all the 
properties of a regular equilibrium system. 

For each of the 2 n admissible sets of independent variables, 
there will be n(n — l)/2 symmetry or skew-symmetry relations. 
In thermodynamics these are called Maxwell relations, and any 
one complete set of them implies all the rest. Matrix notation 
and operations themselves constitute a most excellent mnemonic 
device; so the transformation of (4) and (6) represent an excellent 
substitute for the Bridgman 4 ) and similar formulas by which 
partial derivatives of one set can be converted into partial 
derivatives of another. Given numerical coefficients, we can 
efficiently go from, say, J 6 to / 9 , by the series of steps / 6 , J 7> 
J 8 , / 9 , where at each stage the only needed operations are of the 
Gauss-Doolittle type convenient for desk and other calculators, 
namely a — be , and (a — bc)/d . In practice, there is never 
need actually to renumber variables since the four partitioned 
matrices can always be represented by different colors for the 
numbers or by appropriate underlining. 

4 ) P. W. Bridgman, A Condensed Collection of Thermodynamic Formulas 
(Harvard, 1925). 
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We might call a system a “regular variational system” if it 
satisfies all the symmetry equalities of a regular minimum system, 
whether or not it also has the definiteness properties on its 
Jacobians of a minimum. Before exploring further the inequalities 
implied by the minimum property itself, we first give a single 
mini mum formulation that is capable of handling all admissible 
mixed cases simultaneously. 

6. Our original axiom assures us that F (x) y) attains a 
mi nim u m if we set the x’s at their optimal equilibrium values 
in terms of the y’s as determined by the equilibrium relations 
(1) or (2). Thus, the minimum obtainable of F will depend only 
on the y’s and we may write it as F(y) sS F(x; y), with the 
equality sign holding in equilibrium, and only then. It will be 
easily shown that F (y) = —X (y) for an appropriate choice 
of integration constant in the latter dual function to Y(x). 
This suggests 

Theorem 4: For any regular equilibrium system, we can 
find two mutually conjugate functions Y (x v . . ., x n ) and 
X(y v . . ., y n ), such that 

(14) N (x v . . ., x„; y x , ...,y n ) = Y (x v . . ., x„) n 

+ X{y 1 ,...,y n )-J,x i y i 

i 

reaches a regular minimum value of zero if, and only if, the 
2 n variables are in their n equilibrium relations to each other. 
The Hessian matrices of Y and X will be positive definite 
everywhere and in the equilibrium configurations will be 
exact inverses of each other. 

Note that this minimum formulation includes as special cases 
the dual minimum problems of the Axiom and Corollary 1. 
N ote that it is the only formulation yet stated that treats both sides 
of the dual with perfect symmetry, and the only minimum for- 
mulation that handles all admissible mixed sets of variables. 
If the Y, X, and P functions are called “potentials,” then 
N{x] y), which is defined in the 2 n space and not alone on the 
^-dimensional equilibrium locus, is in the nature of “superfluous 
or disequilibrium potential.” 

To prove theorem (4), we note that by F{y )‘ s definition, 
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F{x ; y) — F{y) does have the stipulated properties of N(x\ y). 
It remains to show that X(y), as previously defined from (2), 
with its Hessian [X i5 ] = is for an appropriate choice 

of integration constant identical to — F(y ). But note that for 
m = 0, P(y v . . y n ), as defined already in (8), is identically 
F(y v . . y n ); and as shown in (9), 


3 F 
fyi 


dP(yi> ■ ■ Vn) 
dVi 


= — (* = 1, . - n) 


which agrees with ( 2)'s definition of X. Hence, X(y v . . y n ) = 
— * • - Vn)- 

Corollary 1: The regular equilibrium is defined in terms of 
any admissible set of mixed variables by 


(15) 


^ — 0 — Y { (x i, . . x n ) y { (i — 1, . . n) 
dN 

— = 0 = Xtfyj, ...,»»)-*< (1 = ^+ 1 ,..., ») 


where the Hessian of N with respect to (x v . . x m \ 
y m +i> • . ->y n ) is positive definite being of the form 



If we were to prescribe an inadmissible set of variables, 
involving both an x 4 and y if then we would get —1 terms in the 
off-diagonal blocks of the Hessian matrix (16), and we could 
not infer that it has the positive definiteness properties needed 
for a minimum. Thus, in the case earlier mentioned where 
= Vi, N{x] y) = IJx) - Jx j y j = - y s ) 2 + and 

prescribing (x v y v y z , . . ., y n ) and minimizing with respect to 
the remaining variables would give N = \{x x — y x ) 2 , which 
need not vanish. This confirms that only 2 n sets of variables 
are generally independently variable. 

It will be noted that (15) shows the first m equations of (1) 
and tRe last (n — m) equations of (2) together to be equivalent 
to (1), or to (2), or to (3). Transforming the differentials of (15) 
by the usual methods will give us J m in the next to the last form 
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of (4), a form that treats both aspects of the dual with perfect 
symmetry. It will be noted that the P functions defined in (9) 
did not treat the dual functions Y andZ perfectly symmetrically, 
there being a difference in algebraic sign. But we can split N 
up into two parts, so that 

m n 

N(x;y) = (Y )-(-*+ 

1 Wl+1 

m n 

(i7) = -(Y- 2y,*,)-(x- 2Vi x i) 

1 m + 1 

(in equil.) = P(y v . . y m \ . . ., x n )-P(y v . . „y m ; . . x n ) 

=P(y v ■ • - y m \ • • •» x n) + P*(*v ..;X n ;.. y n ) 
where the definition of the functions P*, mutually complementary 
to P, is obvious. Through the whole 2n space, P and P* are 
independent of each other, even though they are complementary 
on the ^-dimensional equilibrium locus. 

The minimum formulation in terms of N immediately confirms 
that a set like (y x , . . ., y n \ x m+1 , . . x n ) is truly a mixed one, 
since generally N — 2? x jVo can be resolved into two additive 
functions of n variables each in essentially one way only, as can 
be determined from the pattern of zeros in the 2n by 2n Hessian 
matrix of N. The singular case mentioned earlier, where Y (x) 
and X(y) can be each written as the sum of two functions 
involving no overlapping variables, is instantly revealed by the 
use of N(x; y). Thus, if Y(x v . . ., x n ) = R(x v . . x r ) + 
S{x r + V . . x n ), X will be capable of a similar split and we are 
really dealing with two entirely independent sub-systems. 
That being the case, there is no mathematical way of relating 
x x to x n in any sense different from the relating of x 1 to y n . 

Finally, it may be mentioned that the general minimum 
formulation in terms of N remains valid in irregular cases where 
some of the functions have comers with undefined partial 
derivatives, and where the equilibrium relations may not be 
unique or continuous. (Even the simplest case of phase eq uili brium 
of a solid and liquid provides an “irregular” example where 
functions have a corner and partial derivatives are not defined.) 

7. This exhausts the full empirical implications upon our 
observable functions. If someone specifies in detail any given 
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set of functions, we can in principle compute various sets of 
partial derivatives, test various of the symmetry relations, and 
various of the determinantal conditions for definiteness. However, 
suppose that the functions specified are purely empirical and 
not exactly represented by any finite combination of known 
mathematical formulas. We have a procedure for calculating 
the values of the equilibrium functions at any point. Also, we 
can approximate to various of their derivatives; however, unless 
we are given certain a priori bounds on the higher derivatives 
of the function, no matter how far we push our computations, 
we cannot strictly infer the goodness of our approximations 
and therefore cannot, strictly speaking, ever test a symmetry 
relation with complete rigor. 

8. Fortunately, in applied sciences, we usually have at least 
vague notions concerning smoothness of higher derivatives, and 
we can therefore compute an expression like 


Y 2 (x 1 +h v x 2> . . .) — Y 2 (x lt x 2 , . . .) 


Y 1 {x lt x 2 +h 2 , . . .)—Y 1 (x 1 , x 2 , . . .) 


for “small” hq and h 2 , and decide whether it is sufficiently far 
from zero to refute the hypothesis that dY 2 /dx 1 = dY 1 /dx 2 . 

9. J. R. Hicks 5 ) has derived an interesting generalization of 
the above finite tests for integrability or symmetry. For this 
purpose, let us suppose that the (Y 1 , . . ., Y n ) functions of the 
x’s are indeed the partial derivatives of a function Y (x v . . x n ), 
which has continuous second partial derivatives. Then we can 
write various Taylor's expansions such as 


Y{x\, 4 


. .) - Y(xl 4 . . .) = 2 (*} - • • •) + o 

= 2 K-4Y,(44 ■••) + *; 

1 

= 2 {x) - x*)Yj{x i, x\, ...)- R'i 

= - x^) I^iAii±tlSLAiiA +R 3 


6 ) J. R. Hicks, A Revision of Demand Theory (Oxford, 1956), p. 126. 
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where ( x v x 2 , . . .) or (x) represents a point intermediate between 
(a: 1 ) and (x 2 ); where R 2 and R 2 represent remainder terms 
involving second-degree or higher terms in ( x ) — x 2 ) ; and where 
R 3 involves third-degree or higher terms in (x) — x 2 ) because 
i?2 and —R'z can be shown to differ by terms of higher than the 
second degree. 

Following Hicks, for the rest of this section I omit all terms 
of higher than second degree and state relations that the resulting 
approximations must satisfy. Of course 

0 = [Y (a; 1 ) - Y{x 2 )] + [Y( x 2 ) - Y(x 3 )] + [Y(a; 3 ) - Y(a: 1 )] 

(20) = 

where the equilibrium substitutions ^ = Y i (x v . . x n ) have 
been made. 

This is a generalized integrability condition that any three 
near-by points must approximate to. This general Hicks con- 
dition includes as a special case the above relation (18); to see 
this set (a; 1 ) = (x v x 2 , . . .), {x 2 ) — (a^-f^, x 2 , . . .), (a 3 ) = 
(x v x j + A 2 , . . .), and rearrange the resulting terms of (20) 
to get (18). 

An obvious slight generalization of the Hicks reasoning will 
give us conditions that any 4, 5, 6, . . ., or r distinct points must 
satisfy as they come close together. 

Theorem 5: Any r nearby points observed from a regular, 

continuously smooth, minimum system must satisfy the finite 

equality 

0 = 2 (Y(a;*) - Y(* s+1 )) + (Y (x r ) - Y (x 1 )) 

k—l 

(21) = 2 (ii^f-^ +1 m+yr 1 ))+ 

j= 1 3=1 

= i( &y-2^y]) + & j f i -2x s f)+ ■ ■ ■ 

1 1 

+&x r }y)-ltiy r j))+R a 
1 * 

where R & involves terms of at least the third degree in (xj—xf). 

By setting (a! 1 ), (a: 2 ), (a; 3 ), (a: 4 ) equal to (x v x 2 , . . .), 
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(x x + h v x 2 , . . .), { x v x 2 + K • • •)> ( X 1 + h i> x 2 > • * •)> we can 
convert the above into the equality that must hold in the limit 
between the various more familiar finite approximations to 


d 3 Y d i 

' &y \ 

i_ ±i 

(9 2 Y\ 

3x\ 3x 2 Bx 1 

\Bx ^ BxJ 

1 dxj 



and still other special cases can be derived involving chains 
of any number of observations in excess of 2. 

10. Equations (21) would be adequate tests for symmetry 
and for the variational (as distinct from minimum) aspect of 
the problem were it not for the fundamental methodological 
difficulty already mentioned in paragraph 7. Even if we could 
exactly calculate — free of all statistical or experimental error — 
any point on the ^-dimensional equilibrium locus in the 2 n- 
dimensional space, nonetheless we could never be perfectly 
certain that an observed value for the bracketed expression 
in (18) did or did not refute integrability. If the bracketed test 
expression exactly equals zero, we cannot be sure that for still 
closer together points it will not turn out to differ from zero. 
If the test differs from zero, we cannot be sure that the difference 
is greater than the admissible remainder term R z . Pragmatically, 
we may sidestep this methodological difficulty by hypothesizing 
a 'priori knowledge; or we may not care whether the equilibrium 
system is “really” variational, provided it behaves sufficiently 
like one for the purpose at hand. 

The advantage of being able to convert a problem into variatio- 
nal form seems to be mainly mnemonic. Instead of having to 
record or remember n independent functions Y i (x 1 , . . ., x n ), 
we need only to know one function Y (x lf . . x n ). Or it is enough 
to know everywhere the one function BY (x v . . ., x n )jBx x> with 
BY {x lf x 2 , . . x n )jdx 2 being only known everywhere in the sub- 
space where x — x x \ likewise BY (x v x 2 , x 3 , . . x n )/3x 3 need be 
known only for varying (x 3> . . ., x n ) in the subspace where 
(x v x 2 ) ~ (x v x 2 ); . . . finally 3Y(x v . . x n ~ v x r )\3x n need be 
known only for varying x n with the remaining variables fixed. 
(E.g. in a simple entropy, volume, absolute temperature, pres- 
sure system, knowledge everywhere of the pressure-volume- 
temperature “equations of state” need be supplemented by 
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knowledge of the specific heat function at one temperature 
alone to define internal energy and all thermodynamic behavior 
everywhere. ) 

Also, a general one-to-one transformation of coordinates, 
as defined by x t = f l (v v . . ., v n ), will lead to new variables 
w-y, . . ., w n , conjugate to the v’s and determined by the identity 

dY = St w, dv, = r Vi **i = TdTi Y i - >nt ) (V V dv,. 

11. It turns out, however, that we can in principle give an 
exact test to refute the hypothesis that we have a minimum 
system. Given a set of finite observations, presumed free of 
statistical error or with known bounds on that error, we can 
specify exact inequalities that must be satisfied. Testing an 
approximate equality is in principle a methodologically different 
problem from the easier case of verifying or refuting an inequality. 
From this viewpoint it is desirable to reverse the usual relative 
treatment of a minimum and variational principle. In many 
discussions of classical mechanics, authors are satisfied to state 
a variational principle without being interested in whether the 
resulting conditions relate to “stationariness” or to the more 
difficult question of “definite minimization/' ’ From the standpoint 
of exact empirical refutation, it turns out to be conceptually 
much easier to test the hypothesis of definite minimization than 
to test the variational hypothesis. To repeat, this is because an 
inequality can be empirically refuted more easily than can an 
approximate equality. 

12. Let us consider various distinct equilibrium positions 
(tf 1 ; y l )> (a 2 ; y 2 ), (a 3 ; y z ), . . .. Then by definition of the regular 
minimum 

y (a& ...)- ^y)x) < Y (x\, x\, . . .) - £ y)x) 

1 1 
or 

m 

(22) y(x\, xi, ...)- y(2* 4 • • •) < 2 K - x ))y)> 

1 

showing that R' z in (19) is necessarily negative. Similarly for R' z '. 
Hence 

(23) 0 = [Y(x')-Y{x*)]+ [Y(*a)-Y(zi)] <i{x)-x*){y)-y))- 
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or, letting x) — x) = Ax Jt y) — y 2 = Ay j} we have holding 
between any two distinct equilibrium states 

(24) Ax x Ay x + Ax 2 Ay 2 + . . . + Ax n Ay n > 0. 

In the special case where only one variable of any admissible 
set of prescribed parameters is allowed to change, we get 

(25) 0 + . . . + 0 + Ax i Ay i + 0 + . . . + 0 > 0, 
showing that conjugate variables must change in the same 
direction when no change takes place in at least one of every 
other pair or conjugate variables. (This is a weak form of the 
so-called Le Chatelier principle.) 

If we have m observations, we can apply the above test (24) 
to each of the r(r — l)/2 pairs of observations. If any one of 
the inequalities fails to be satisfied, then the minimum hypo- 
thesis is refuted. However, if we have more than 2 observations, 
further independent inequalities must be satisfied. 

From (22), we have 

(26) (x) - *J) + £ y) (x% - a£) + £ (*5 - aj) > 0 

111 

= [Y(x')-Y(x 2 )] + [Y (x 2 ) - Y(x*)] + [ Y(x ») - Y(rf)] 

a test that every triad of distinct points must satisfy. Reversing 
the order of the points, we get a second independent inequality 
binding the same triad. 

Generally, for any circular chain of distinct points 
(x 1 ; 2 / 1 ), (: x 2 ; y 2 ), . . ., (x r ; y T ), (a r+1 ; y r+1 ) — (x 1 ; y 1 ), we have 
by similar reasoning 

(27) £ E y>\ - sf 1 )] > 0 = i [Y (*,) - Y(z t+1 )]. 

1=1 j i—X 

Reversing the order of the points from (1, 2, . . ., r, 1) to 
1 , r, . . ., 2, 1 gives a similar relation 

r-fl r 

(28) o > 2 - ^r 1 )] = - is 2 // +1 h - -r 1 )]. 

2 j 1 

By duality, we can infer relations just like (27) and (28), 
such as 

£ S A(v\ - $ -1 )] < °; 

i=l s 


( 29 ) 
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(29) can be derived directly from (28) by rearranging terms, and 
hence does not represent an independent condition. 

The number of independent inequalities is equal to the number 
of distinct circular chains that can be formed from the finite 
given number of observations, r. If r — 2, we have a single 
condition, namely (24). If r — 3, we can form 3 independent 
pairings, and hence we have three chains of length two to 
provide us with inequalities. In addition, we have two in- 
dependent chains of length three; viz. 123, 321. So altogether 
for r = 3, we have 5 independent inequalities to test. 

Given any k points, we can make N t = (k — 1)! circular 
chains involving all k elements. To prove this, note that N 2 — 1, 
because 12 and 21 are the same circular chain. Also, we go 
from a chain involving 12 ... k — 1 in some order to one in- 
volving 12 . . ., k — 1, k in some order by putting between any 
two adjacent numbers of the shorter chain the new element k. 
Since there are k such places to insert the new element, obviously 
there are k new chains of length k for each chain of length k — 1. 
Therefore, N k = kN k _ x , a recursion relation yielding, with 
N 2 — 1, the unique solution N k = (k — 1) ! 

The total number of independent inequalities that any r 
empirically observed points must satisfy is the sum of all possible 
chain-inequalities of length k ^ r, and therefore equals in number 

N,+ (s) w '+ ■ ■ ■+ Q w '" Q 1 + (3) 2 + ()) 31 + • • ■ 

+(,:,) fr- 2 ) i +(— i > 

For r = 2, 3, 4, 5, we already have 1, 5, 20, 74 independent 
inequalities to be satisfied, and the number rises rapidly with r. 
(Incidentally, there are the same number of Hicksian equalities, 
of the type given in (21) that must be approximately satisfied 
for nearby points.) 

This may all be summarized as follows. 

Theorem 6: Any r distinct observations (a; 1 ; y 1 ), . . .,(af; y T ) 
arising from a definite minimum system must satisfy 25 (p 
(&— 1)! independent inequalities of the type 
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* 

2 C^i^i + y*Ax 2 + ...]> o, k^r 

i = 1 

where the summation is over a circular chain consisting of k 
distinct points and returning to the original point and where 
the superscripts in y] and in x) — xj +1 = Ax } have been 
omitted. Similar, but non-independent, dual relations involving 
Ay and x can be written. 

13. Relations of the type given in Theorem 5 are necessary 
conditions: violation of any one of them could serve as a refuta- 
tion of the hypothesis that we are dealing with a regular minimum 
system. (If we drop regularity assumptions, equalities must be 
added to the inequalities but otherwise the theorem is still 
valid.) The question naturally arises as to whether there can be 
any further necessary conditions beyond the ones already de- 
rived. That is, are these necessary conditions in some sense also 
sufficient conditions for a regular minimum system ? The following 
answer can be given. 

Theorem 7: Given n smoothly differentiable relations 
holding between 2 n conjugate variables, namely y l —Y i (x 1 ,...,x n ), 
i = 1 , 2, . . ., n, and if there can never arise for any finite 
number of observations a violation of any of the inequalities 
of Theorem 6, we are then assured that the system is a regular 
minimum one, with symmetric positive definite Jacobian 
matrix and all the other properties of such a system. 

To prove this, consider a chain of m distinct points that he 
along any simple closed path or contour in the n dimensional 
(x v . . ., x n ) space, with 1 , 2, . . ., m being arrayed by conven- 
tion in counterclockwise order. Define S m = 2i’Ei yi( x i~~ x j +1 )l 
and s m = K — af 1 )]- B y ( 27 ) and ( 28 ) and our 

present hypothesis, it is to be true that S m is positive and s m 
is negative for any m. Now let the number of observations, m, 
along the same simple closed contour increase indefinitely in 
such a way as to make the distance between any two adjacent 
points go to zero in the limit. Then clearly from the definition 
of an integral as the limit of a sum, 

lim S m = f Ji/idx, = lim s m 

mr->oo J m—*co 
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where the indicated line integral is taken counterclockwise 
around the specified simple closed contour and where y s = Y s 
(x v . . x n ). Since this line integral is the limit of a sequence of 
positive numbers and at the same time is the limit of a sequence 
of negative numbers, its value is proved to be zero along 
any closed contour. Hence the expression '%y,dx j must be an 
exact differential dY (x v . . ., x n ), with y t = Y l (a;*, . . x n ) = 
Y { (x lt . . ., x n ) and Yj = Y\. This proves the symmetry or 
variational aspect of the problem. 

To prove the definiteness, we may consider the inequality 
that must hold between any pair of points; given our assumptions 
concerning differentiability, this may be written in the Taylor’s 
expansion 

0 < 2 dyjAXj = 22 Y ij Ax t Ax j 

i ii 

where not sllAXj are zero and where the second partial derivatives 
are evaluated at some point intermediate between the two 
points in question. Clearly at such a point the quadratic form 
made up of the Hessian matrix must be positive definite; and 
the same must be true of every observed point, else we could 
write down a contradiction to our finite inequality. This com- 
pletes the proof. 

The interpretation of this theorem is rather delicate. It shows 
that we can, by taking enough observations close enough 
together, always succeed in detecting any deviation of the facts 
from the regular minimum hypothesis. But it does not tell us 
how many observations will be necessary. 6 ) Therefore, no matter 
how many observations we may have examined, all of which 
satisfy all requisite inequalities, we cannot be sure but that 
still further observations might show a violation of the hypo- 
thesis. This is a common situation: methodologically, we can 

®) In Economica , February, 1953, p. 9 I posed the “open question” as to 
how many situations m might be needed to test the existence of an integrable 
field. Were n «= 2, we know that m — 2 would be sufficient. For n 3 is there 
a similar finite m — &(n) bound on the number of situations that might be 
needed to refute integrability? The question can now be closed by the Statement: 
when there is definitely non-mtegrability, the closer the conditions for inte- 
grability are to being fulfilled, the larger must be the m needed to reveal that 
non-integrability; so there is no a priori finite bound on m possible. 
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refute a hypothesis with a finite number of observations, but we 
can never strictly “confirm” it in this way. Furthermore, the 
theorem does not tell us that, for fixed m, satisfying all the 
prescribed inequalities is all that can be required of the ob- 
servations: i.e., it does not prove the non-existence of any further 
independent implications of the regular minimum hypothesis. 
I would conjecture, from various algebraic and geometric 
considerations, that no further independent inequalities can be 
prescribed; but this has not yet been proved. 

14. This completes the program of characterization of the 
necessary and sufficient conditions for a system to have the 
structure of a regular minimum. 7 ) A number of further problems 
may be briefly mentioned. First, the smoothness and uni- 
queness assumptions involved in the notion of “regularity” 
may be dropped, and investigations may be made of the resulting 
modifications in the theorems. With partial derivatives not 
being defined at sets of points, the Jacobian and Hessians of 
the first part of this paper will then not necessarily exist or be 

7 ) The rather ambiguous principle of Le Chatelier, in its correct form 

x\(yi y») S *{( 2 /! 2 t/n- 1 . *„) ^ ^ xEj( yi ; x 2 , ...,x n ) > 0 

can be derived from our conditions, as can the equivalent dual form which 
says that 2/JJ(2/ x , . . y m ; . . ., x n ) must be positive and non-decreasing with m. 
But adding the same finite positive Ay, to each (y v . . y m \ x m+1 , . . ., x n ), 
all of which correspond to the same initial equilibrium point, need not lead 
to the finite inequalities 

xl (yi+4yi, . . y n ) ^x 1 (y 1 +Ay l y„^; x„)> ... ^ a l {y 1 +Ay 1 ) x 2 , . . ., x„). 

For sufficiently small changes in y, these finite inequalities must hold, but the 
proof is not immediate. For finite moves of any size, it is easy to give counter- 
examples that are nonetheless perfectly regular. This may be regarded as 
slightly paradoxical since it says that from any point in the (pressure, volume) 
diagram, there will be an isothermal line “flatter” than an adiabatic line; 
yet, the finite change in pressure in a system when you change its volume 
holding temperature constant, can turn out to be greater than the change in 
pressure resulting from an equal change in volume with entropy being held 
constant. The paradox is disspelled if one realizes that 

My, 

J 0 { x VUi + t, .... y m i . . ., x„) - x\{y 1 + y m+1 ; ■ ■ ■, z n )}dt 

has an integral that starts out non-negative at t = 0 but which may become 
negative as the equilibrium points being compared become different ones. 
The last vestige of paradox disappears when we realize that regular systems 
need not admit of (p, v) as prescribable coordinates; hence, any point in this 
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invertable. Extensions of the Implicit Function Theorem and 
of the concept of one-to-one correspondence will lead to results, 
of which the regular systems constitute one special case. As far 
as the finite inequalities of the latter part of the paper are 
concerned, with the addition of equalities to the inequalities, 
they will all remain valid. 

A second, quite different extension of the present program is 
to apply it to the study of regular systems involving an in- 
finite number of variables or the equivalent. Thus, the differential 
equations of classical mechanics can be regarded as functional 
systems that are limiting cases involving an infinite number of 
variables. The question naturally arises as to an exhaustive 
summary of the necessary and sufficient conditions that the 
observable motions must satisfy if they are to have arisen 
from a regular variational system that is or is not a true mi nim um 
system. As far as I know, no one has yet carried through such 
a program in the full detail of the present examination of the 
simpler case of a finite number of functions of a finite number 
of variables. The present discussion shows some of the problems 
that will certainly be encountered in the more difficult realm of 
function spaces. 


diagram may correspond to more than one physical state of the system. 

A slight generalization of the Le Chatelier principle can be stated in terms 
of quadratic forms, namely 

r r rr 

SSaKVi Vm, ■■ : »»)M/ ^ ss x*(y v ...,y k ; ... x n )h,hf 

1 11 

where r ^ k < m. This is provable directly from the form of J n in (6) and its 
dual. 



APPENDIX 


Axiomatic Basis For Equilibrium Relations in Classical 
Thermodynamics. 


1. The formal mathematical analogy between classical thermo- 
dynamics and mathematic economic systems has now been 
explored. This does not warrant the commonly met attempt 
to find more exact analogies of physical magnitudes — such 
as entropy or energy — in the economic realm. Why should 
there be laws like the first or second laws of thermodynamics 
holding in the economic realm? Why should “utility” be literally 
identified with entropy, energy, or anything else? Why should 
a failure to make such a successful identification lead anyone 
to overlook or deny the mathematical isomorphism that does 
exist between minimum systems that arise in different dis- 
ciplines? 

In this Appendix, some of the special differences between the 
variables and relations that arise in thermodynamics and in 
economics will be briefly explored. This will involve giving a 
slightly unconventional axiomatic base for classical thermo- 
dynamics, or at least for that part which deals exclusively with 
equilibrium states and which might be called thermostatics. 1 ) 

2. While in economics the variables (x 0 y { ) will usually be 
observable quantities of goods and their respective prices, in 
thermodynamics they will not all be as immediately observable 
as is the case of volume and (negative) pressure. Thus, (x v y x ) 
might correspond to “entropy” and “absolute temperature,” 
which to early writers were not known, directly observable 
quantities. And a function like Y (a^, . . ., x n ), which in economics 
might be easily measured as dollar revenue, in thermodynamics 

2 ) Such an axiomatic excursion departs from the usual Clausius formulation, 
in term£ of the first and second laws. (Cf. also the work of Carnot, Kelvin, 
Gibbs, and Planck). It resembles a little the alternative axiomatic approach 
of C. Carath^odory, Mathematische Annalen, 67 (1909) pp. 355 — 86. See M. 
Born, Natural Philosophy of Cause and Chance (Oxford, 1949), Ch. V for a good 
account and for references. 
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might correspond to an internal energy function whose existence 
and properties have to be established by intricate reasoning. 

Fig. 1 plots the properties of a simple system whose states 
can, for simplicity, be assumed to be determined by pressure 
and volume (p, v). The solid contours represent loci of equal 
“temperature” and as yet have no natural, preferred numbering 
on them corresponding to absolute temperature, t. Instead 
T = T(v, p) represents any monotone numbering; and any 
renumbering T — f(T) = T'(v, p), with / an arbitrary mono- 
tone-increasing function, would be an equally good indicator 
of greater and smaller temperatures. 

The broken contours represent "adiabatics” and represent 
loci of points that could be observed if the system were "in- 
sulated” from its environment by perfect non-conducting walls. 
I omit a good deal of explanatory matter needed to establish 
the exact nature of these two sets of contours, instead taking 
them as the given primitive concepts of the axiomatic system. 
Note that it would be premature to call the adiabatics “isen- 
tropes” since we do not yet have any entropy magnitude to be 
held constant. Yet we can give an arbitrary numbering to the 
adiabatics, written as S = S(v, p) or as S' = F(S) = S'(v, p), 
where again F is an arbitrary monotone-increasing function 
which will preserve the direction of the ordering. 

3. The first task of our thermodynamic system is to define 
its own “canonical entropy and temperature” so that we can 
go from the arbitrary scaling of T and S to a privileged canonical 
temperature t and canonical entropy s; when this is done we shall 
be able to define an internal energy function e = e(v, p) or 
e = E(s, v). 

Note that Fig. 1 is drawn with the special property of “pro- 
portional areas.” Thus the shaded area A is to B as the area 
A' is to B'. And the same equal proportionality would be true 
of the appropriate curved parallelograms formed by any four 
thermal contours intersecting with any four adiabatics. 

This equal proportionality of areas is not an accident’, it is a 
fundamental regularity of nature, from which we can deduce the 
existence of an internal energy, a canonical temperature and 
canonical entropy. 
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4. In fact pick any thermal contour aa as the arbitrary origin 
for temperature and any other thermal contour bb as the arbi- 
trary t = 1 level which will set the scale of one unit of canonical 



i.o v 

Fig. 1 


temperature. We can now give a determinate number t to any 
arbitrarily given thermal contour by making its t the ratio of 
the algebraic area that it and any pair of adiabatics form with 
aa relative to the area formed by the unit temperature contour. 
Fig. 2. shows the resulting canonical temperature scaling for 
each contour. (Verify that cc corresponds to t = 1.5 because 
the area C + B is one and a half times as great as the area C 
alone; of course (C' + B')/B f gives the same ratio. And dd 
corresponds to t = —2.0 because the area A below aa is twice 
the absolute size of the area C.) 

5. Having defined canonical t, we have no further use for 
T or T\ And we can now go from S or S' to canonical s, called 
entropy. This may be measured from any point selected as an 
arbitrary origin: thus the point northwest of A may be taken 
as zefo origin, although it is to be understood that this entropy 
origin would not have to be at a point where t = 0. 

Once the dimension of pressure is selected, the scale of s is 
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not arbitrary. Suppose /5/3 has been selected just the proper 
distance from aa as to make the area C (which has the dimension 
of pressure times volume) be exactly unity in area. Then /3/S is 
given the designation s = 1.0 and every other adiabatic is 
given a unique s determined by the algebraic value of the 
area which it makes with aa, bb, and aa. (If the adiabatic is 
to the left of aa we give it a negative entropy with magnitude 
proportional to appropriate absolute area. See Fig. 2 for il- 
lustrative s values.) 



Fig. 2 

6. Now that we have defined the functions t = t(v, p) and 
$ = s(v, p), we can verify that the equal proportionality of area 
property implies that the line integral 
(Al) t(v, p) ds(v, p) — pdv 

must be independent of path and expressible as the exact dif- 
ferential de(v, p) of an existent “canonical internal energy” 
function & — e(v, p), which is arbitrary in its origin because of 
the constant of integration. 

Because the canonical temperature t is arbitrary in its origin, 
transforming from t to t + t 0 , would introduce an arbitrary linear 
entropy term t 0 (s — s 0 ) in our expression for canonical energy. 
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Within the realm of thermostatics, where we merely observe 
equilibrium states and make no measurements of “irreversible 
processes/ * we can never hope to remove this ambiguity in the 
definition of canonical internal energy. 2 ) We can never hope 
to deduce the level of absolute zero, which characterizes the 
Kelvin scale. We cannot define the “efficiency” of an hypo- 
thetical Carnot cycle from knowledge of the isotherms and adia- 
batics alone. (In fact the present treatment never explicitly 
mentions “heat,” inexact differentials lik edQ, or exact differen- 
tials like dQ/t.) 

7. Under our simplifying assumption that (v, p) are admissible 
state variables, we can rewrite internal energy ^ as a function 
of the pair of variables (s, v) to get E(s , v) such that 

(A 2) dE(s , v) = t(s, v)ds — p(s, v)dv . 

Here E(s, v) corresponds precisely to my earlier Y(x v x 2 , . . .); 
and it would be easy to show that the Gibbs potential 
e — ts —{—p)v, written as a function of t and —p, corresponds 
to the dual X(y v y 2> . . .). Corresponding to P{y 1 \ x 2f . . .) would 
be the Helmholtz free energy e — ts written as a function of 
t and v . 

8. All of the above implications of the equal-proportional 
areas axiom can be given brief mathematical summary 3 ) in the 
case where all the functions have nice (overly strong) differen- 
tiability and regularity properties: in particular where any two 
T(s, p) and S(v, p) contours intersect in one and only one point 
so that these functions can be inverted to give v = v (S, T) 
and p = p(S, T) with smooth partial derivatives. 

Suppose we know that indicators of temperature and entropy 
can be converted by determinate (save for arbitrary origin 
constants and scale constants) functions t == f{T) and s = F(S) 
to yield canonical temperature and energy; and that there exists 
a canonical internal energy function e = E($ t v) with the 
Maxwell reciprocity or integrability property 


*) TRis was noted by Carath^odory, op. cit., p. 381. See also A. Land£, 
Handbuch der Physik, Band IX, 281 — 300 for a treatment related to the 
present one. 

3 ) This and the next section can easily be skipped. 
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(A3) 


d 2 E(s, y) _ g 2 £(s, v) 
dvds dsdv 


From the form of the exact differential in (A2), this is equivalent 
to 


(A4) 


dt(s, v) _ dp(x, v) . 

dv ds 


which in terms of arbitrary T and S and f(T) and F (S) becomes 


(A5) 



f'(T)F'(S). 


Now the left-hand side of this relation is a perfectly observable 
magnitude, being a function of (v, p) or (S, T) or any other 
two variables. Let us call it J. The equal-proportional area 
property — which is an integrability property in the large — 
is equivalent to saying that this observable left-hand expression 
J is truly a product of two separate functions. I.e., necessarily 
log J must be the sum of a function of T and a function of S, 
so that 


(A6) 


3 2 log J(S, T) 

dSBT 


is a necessary and sufficient condition for the equal-proportional 
area property to hold. 

An alternative functional equation which J must satisfy is 


(A7) 


J(S» T x ) J(S 2 , 7\) = 

/(Si. T 2 ) J(S 2 , T 2 ) 


for all (S lt T x ) and (S 2 , T 2 ). 

Both (A6) and (A7) are observable conditions which could 
be empirically verified. 

Incidentally, by some manipulations of implicit fiftiction 
theory, it is easy to show that J is a certain Jacobian determinant, 
namely 
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(AS) 


(dp\ 

\dS/ v 

1 


dv dv 

BSdT 

® “ 

dSdS 


dp dp 

\dv/ s 

dv dp 


dSdT 


dT dT 




dv dp 




d(v, p) 


d(S, T) 


>/(S. T). 


9. Given the observable function J(S, T), we can derive 
canonical temperature and entropy by simple integrations. Thus 


(A9) 


t = f{T)=t 0 +^ f'(r)dr 

, , , /T ,r /(So. t) , 

= *0 + / r o 777 — 

Jt o J{S 0 , T 0 ) 


where £ 0 = f[T 0 ) is an arbitrary origin constant and f'[T 0 ) is 
an arbitrary scale constant. Also 


(A10) 


s — F(S) = s 0 + 




J(°> T 0 ) 
f'(T 0 ) 


da 


Because J has the remarkable multiplicative property, this 
f(T) integral will be independent of the S Q level and (AlO)'s 
F (S) integral will necessarily be independent of the T 0 level. Note 
that the origin of s is arbitrary, but the scale is definitely not. 

Since the line integral ids — fdv is independent of path, we 
can by a variety of alternative integrations calculate the canonical 
internal energy function E(s, v), which is determinate except for 
an arbitrary linear term t 0 (s — s 0 ). 

10. So far the discussion dealt with simple systems with only 
two degrees of freedom: e.g. a single homogeneous fluid. The 
existence of an absolute or Kelvin temperature which is the same 
for all substances shows that the equal-proportional area law 
must apply also to the interrelations of different bodies. Thus 
Fig. 3 shows that if two bodies are in respective equilibrium 
at t = 0 and at t = 1, then we can separately for each calculate 
the canonical temperatures = \ and t % = \ by our previously 
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described techniques of canonical temperatures. Then we can 
confidently predict — in advance of ever having made the 
experiment! — that the = \ and t 2 = \ contours will truly 
represent the same temperature and represent mutual equi- 
librium. 



Fig. 3 


This requires that the area relations AJB-l = A 2 /B 2 hold 
between different substances. (In fact this is the generalized form 
of our equal-proportional area axiom, from which all the other 
relations of thermostatics follow.) 

11. There are far-reaching implications of the axiom. Thus, 
suppose two (or more) bodies are separated by diathermous 
walls (i.e. walls which require the bodies’ mutual equilibrium 
to be at equal temperatures) and that these walls are also elastic 
(i.e. such as to require equal pressures at equilibrium). Then 

(All) ^ = ^ »a) = • • • = * = /CO 

V l) = Pz( S 2> V%) ~ = ft- 

But in terms of the internal energies (s 4 -, yj.^this means 


(A12) 


ggife, ^i) (s 2 > v 2 ) 

os 2 

9-Ei(%, %) _ SE 2 (s 2> v 2 ) 

dv-L 


* 


dv, 


=~P. 
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which in turn is seen to be the necessary condition that 
e = *1 + H + • • • = E i( s i> Vi) + e z( s 2 > ®a) + • • • be at a min- 
imum subject to 

5 1 + 5 2 + . . . = S 

v x + v 2 + . . . = v; 

and that the resulting minimized energy be a function of the 
total s and v respectively — namely 

minimum e = E(s x + s 2 + . . v x + v 2 + . . .)=E(s, v) with 


(A13) 


(A14) 


dE(S, v) 
ds 


= t. 


dE(s, v) 
dv 


-P, 


where again an arbitrary linear term in total entropy t 0 (s — s 0 ) 
will be involved. 4 ) 

For a true maximum, it is sufficient that each E i [v t , s ( ) have 
a positive definite Hessian matrix of second partial derivatives 

-d*E< d*Er 

d sf dv { ds { 
d 2 E i d 2 Ej ’ 

_ds i dv i dV( _ 

and then it will follow that the Hessian matrix of E(s, v) is 
positive definite. 40 ) 

4 ) If the bodies are in thermal contact but separated by rigid walls, the 
pressure equality of (A12) is lost and the volumes (v lt v 2 , . • *) are separately 
specifiable: we then get e — e x -f e 2 + • • • = E(s 1 + v v v 2 • . •) in 

minimized form, with dE/dVi — —p t and dEjds — t. 

4o ) Many treatises on thermodynamics seem to be saying that E(s, v) is at 
a minimum for prescribed s and v. Actually li(s, v) is a determinate function 
of its arguments and is not free to be at a maximum or a minimum. It is the 
sum E 1 (s v vj -j- E 2 (s 2 , z; 2 ) which has its (s { , varied subject to fixed + s 2 
and v x + v 2 until E x + is at a minimum. When the resulting optimal ( s it v t ) 
are substituted into E x {s x , v x ) + E 2 (s 2 , v 2 ) the result defines E(s, v). 

That the Gibbs free energy or potential is at a minimum for fixed t and p 
likewise requires careful statement. Write the expression e — ts — (-p)v, 
regarded as a function of t and p alone, as G{t, p). Then G(t, p) cannot be at a 
minimum for arbitrarily prescribed (t, p), nor at a maximum, nor anything 
else but at its determinate value. Rather it is G(s , v ; t,p)=*E (s, v) — ts — {~p)v 
that is at a minimum with respect to (s, v) for prescribed (t, p). This can be shown 
to be a special case of the condition that E x + E 2 is at a minimum for prescribed 
v x -f v 2 and s x + s 2 . Think of 5, v and E as belonging to one body, and hence 
write them as s x , v x , E x (s x , v x ). Now consider a second body so large compared 
to the first that its temperature and pressure will be affected negligibly by 
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12. Note that in contrast to the Carath4odory-Bom treat- 
ment, there was here no stipulation of any integrability con- 
ditions or inaccessibility conditions other than that of equal- 
proportional area. Once the T t = T i (v v p t ) and S { = Si(v it p t ) 
contours were specified and the equal-proportional area axiom 
of a single uniform canonical temperature was specified, then 
all the remaining relations of thermostatics logically followed. 

13. Instead of minimizing total energy e for given total 
entropy s and total volume, with t > 0 we could have maximized 
entropy for given total energy and volume. Leaving thermostatics 
for true thermodynamics and actually observing the time 
changes of a given isolated system, we would actually find that 
total energy would remain constant and total entropy would 
increase in time. But within the realm of thermostatics — i.e. 
the study of equilibrium relations like (All) — we would have 
no knowledge of this or interest in it. 

14. The present formulation 5 ) thus, seems to have turned 
the usual formulation upside down. Rather than begin with 
the first and second laws of thermodynamics and then deduce 
equilibrium relations and the existence of a universal absolute 
temperature, we have assumed a universal canonical temperature 
(with arbitrary origin) and have then deduced equilibrium 
relations and integrability conditions. We have assumed less 
than the first and second laws: and we deduce less. We deduce 
only thermostatic relations, deducing almost nothing about 
irreversible processes. (In the original definition of the adia- 
batics S { = S { (v { , p { ) there does lurk in the background an 

the first. (A “heat and pressure bath.") Write its energy as E 2 (s 2t v 2 ) =. 
f$2 — where l and p are fixed parameters. Now minimize E 1 (s 1 , ^ i ) + 

E 2 {s 2 , v z) subject to fixed s = s ± + s 2 and v — v t -f v 2 to get 
Minimum E x (s v v x ) + E 2 {s 2 , v 2 ) = [E x (s lf Vj) — ts x - \-p)v{\ + Is + 

= G ± (.%, t, p) + a constant. 

This last expression shows that minimizing the Gibbs G 1 is a special case of 
minimizing E x -f- E if the total of the observed system and its “environment.” 

6 ) The equahproportional area property on which all this is based was 
suggested to me by an obvious extension of the reasoning of Maxwells Theory 
of Heat (1871 edition), but I have not seen it explicitly alluded to in any 
twentieth century book. It is, however, so obvious and basic a property that it 
must have been rediscovered many times. 
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implicit reference to irreversible processes and two-dimensional 
“inaccessible neighboring states.” But once the primitive notion 
of an adiabatic is assumed, all the rest is thermostatics.) 

The Gibbs thermodynamics of heterogeneous substances has 
not been here discussed. But what was done here for — J - fdv 
areas could also be done for — J f^idMf areas, where u, i represents 
chemical potentials and M t chemical masses; and it would seem 
that no new mathematical problems would be raised. 
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Decentralization and Computation 
in Resource Allocation 

I. FORMULATION OF THE PROBLEM 
A. Introduction. 

In this paper, we wish to discuss the bearing of some recent 
developments in mathematical economics on the problem of the 
optimal allocation of resources. We will confine attention here 
to an economy whose aims are well defined. That is, we assume 
that the preferences of the economic system can be embodied 
in a utility function which depends upon the outputs of com- 
modities. For a given technology, the possibilities of different 
output combinations are restricted by the availabilities of 
primary resources. The problem of optimal resource allocation 
is to choose among all the feasible combinations of production 
processes that combination which maximizes the utility achieved 
by the economy. 

Since the discussion is at a fairly high level of abstraction, the 
economy being studied may be a nation or some smaller economic 
system, including a single firm. The assumption that a single 
utility function represents the objectives of the economy fits 
best the case of a firm. For a nation, the assumption is less 
justified, but it provides an introduction, at least, to the more 
complex problem raised by the presence of many individuals, 
each of whom judges the workings of the economic system in 
light of his own utility function. We also avoid the subtle 
problems involved in defining optimality in the more general case. 

The problem of choosing the allocation of primary resources 

* The research reported on in this paper was to a large extent carried on 
under the auspices of the RAND Corporation. We also wish to thank the 
Office of Naval Research and the Center for Advanced Study in the Behavioral 
Sciences for their assistance and M. J. Beckmann and W. M. Gormon for their 
comments. 
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among different productive processes so as to maximize a 
prescribed utility function is a mathematical one, and its 
solution in any concrete case can be regarded as a matter of 
computation. Now any computation, beyond the very simplest, 
involves a process of successive approximations, that is, a process 
carried out in several steps, where the calculations in each step 
make use of the results of the preceding steps and one of the 
results of each step is an approximation to the desired answer. 
(The ordinary process of long division is an example of a process 
of successive approximations; the results of each trial division 
step are another digit to be added to the quotient and a new 
dividend to be used in the next trial division.) 

That the market place solves the economic problem of equating 
supply and demand by successive approximations to the equi- 
librating price or prices is a concept familiar from elementary 
economics textbooks. In a single market, each approximation 
results in naming a price and calculating the difference between 
demand and supply at that price; the next approximation in- 
volves adjusting the previous trial price in a manner governed 
by this difference, with the idea of ultimately wiping it out. 
The notion of this kind of dynamics in the movement of prices, 
has its roots in the English classical economists, perhaps par- 
ticularly John Stuart Mill, but it received its first explicit 
recognition, particularly with reference to simultaneous succes- 
sive approximations to all prices, in the concept of tatonnements 
of L<§on Walras. 1 ) 

Since welfare economics assures us that under certain as- 
sumptions as to the utility function and the productive process 
(see part III, section A below for a more explicit discussion of 
these assumptions) a competitive equilibrium can be identified 
with an economic optimum, we may conclude that the method 
of successive approximations which solves the problem of market 

1 ) See L. Walras, Elements of Pure Economics, ed. and tr. by W. Jaffe, , 
(London: George Allen and Unwin, 1954), pp. 84 — 86, 90 — 91, 105 — 6, 169 — 172, 
243 — 54, and 184 — 95. For a vigorous and stimulating analysis of Walras's . 
theory o ftatdnnements and its central function in seeking to show that economic 
equilibrium is solved in the market by successive approximations, see D. Patinkin, . 
Money , Interest, and Prices (Evanston, Illinois, and White Plains, New York:. 
Row, Peterson and Company, 1956), pp. 377 — 85. 
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equilibrium is also a computational method for solving the 
problem of optimal resource allocation. Indeed, it was seen in 
precisely this light by Pareto 2 ), who compared the market 
to a computing machine. Of course, as soon as the problem of 
optimal resource allocation is formulated as the solution of a 
system of simultaneous equations (namely, the equations 
defining competitive equilibrium), the possibility arises of solving 
them by some centralized procedure involving the use of com- 
puting machines rather than the market. Pareto objected that 
the enormous number of equations made such a procedure 
impossible. A completely centralized organization would require 
a capacity for storage and processing of technological and other 
information that exceeds anything likely to be available. The 
competitive process, on the contrary, achieves decentralization . 3 ) 
At each stage in the market's process of successive approxima- 
tions, any individual firm adjusts its tentative production plans 
making use of information only about the current tentative 
prices and its own technology. The adjustments of tentative 
prices, at the same time, depend only on the aggregate demands 
and supplies. These are simply a sum of the tentative production 
plans of the individual firms (and consumption plans of con- 
sumers) plus the originally existing supplies of basic resources. 

Thus the information needed by firms and consumers consists 
solely of their technologies or utility functions plus prices, while 
the adjustment of prices is based only on the aggregate of in- 
dividuals' decisions. It is the minimization of information 
requirements for each participant in the economy which con- 
stitutes the virtue of decentralization. 

Our aim here is to state more precisely than hitherto the 

a ) V. Pareto, Manuel d’liconomie Politique (Deuxi&me fCdition, Paris: Marcel 
Giard 1927), pp. 233 — 34. In the terminology of cybernetics, Pareto, crystal- 
lizing the usual views of economists, has held that the market mechanism is 
homeostatic, a position strongly denied by the founder of cybernetics; see N. 
Wiener, Cybernetics (New York and Paris: John Wiley and Sons, and Hermann 
et Cie., 1948), pp. 185—86. The comparison between the market and a com- 
puting machine has also been made by R. M. Goodwin, “Iteration, Automatic 
Computers, and Economic Dynamics/* Metroeconomica, III (1951), B — 7. 

8 ) For a possible definition of the term “decentralization** in this context, 
see L. Hurwicz, “Decentralized Resource Allocation,** Cowles Commission 
Discussion Paper No. 2112, 1955. See also the first paragraph of p. 77 below. 
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dynamic system which is implied by the market mechanism, 
to give conditions under which the resource allocation determined 
by it converges to the optimal, where optimality is defined in 
terms of a single utility function for the economy, and to suggest 
modifications of the market mechanism which still preserve 
some degree of decentralization for cases where the conditions 
in question are not satisfied. The conditions for convergence 
of the unmodified market mechanism are basically those of 
diminishing or constant returns in production and diminishing 
marginal utility (in a generalized sense) for the consumption of 
final demands. To study these dynamic problems, we will make 
use of a variety of mathematical tools, some of which have arisen 
in the theory of games and of linear and nonlinear programming 
and some of which are more classical applications of differential 
equations to maximization problems. 

In section B of this part, we will review the history of the 
problem of dynamic adjustment to a social optimum from its 
initial formulation by Walras. In section C, we will state more 
precisely our resource allocation model. In part II, the mathe- 
matical tools that will be used are reviewed; they are primarily 
the gradient method (or method of steepest ascent), which is 
a system of differential equations for solving a maximization 
problem, and the relation between constrained maxima and saddle 
points. In part III, we study the case of production under 
diminishing or constant returns with a strictly concave utility 
function (see Definition 1 below; the condition is a generalized 
form of diminishing marginal utility). In this case two dynamic 
systems which formalize the intuitive notion of a market 
mechanism have been shown to converge. In part IV, we 
consider the cases where unmodified market processes do not 
converge, in particular, that of increasing returns. Here three 
modified market mechanisms are proposed and shown to have 
some desirable properties. One such mechanism is closely related 
to imperfect competition, another to speculation. It is also shown 
that the purely linear case (linear utility function and constant 
returns* to scale), for which the unmodified market mechanisms 
do not converge, can be completely solved by some of the 
modified mechanisms. 
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B. Historical Remarks. 

Most of the discussions in the economic literature on the 
achievement of a social optimum through the market, whether 
in a socialist or a capitalist economy, have contented themselves 
with a static characterization. 4 ) The main contributors to a 
dynamic formulation of the market mechanism have been Walras, 
Pareto, Taylor 5 ,) Lange, 6 ) and Samuelson. 7 ) 

In the present study, we are concerned with a single consumer 
in whose interests the whole economy is run. Walras, Pareto, 
Taylor, and Samuelson are all concerned with a multiplicity of 
consumers; Lange deals with both cases. In the many-consumer 
case, it is assumed that the consumer maximizes his utility 
instantaneously, so that his demand for commodities is a given 
function of prices. 

Walras was not explicitly concerned with a socialist economy, 
but he did regard the competitive system as a computing device 
for achieving a maximum of satisfaction to society. On the 
production side, he assumed that all production processes were 
linear-homogeneous. 8 ) In this case, the profit is proportional 
to the scale, at any given set of prices, and the marginal profita- 
bility is simply a constant. Walras’ rule for adjustment when the 

4 ) See for example A. P. Lerner, The Economics of Control (New York: 
Macmillan, 1946) or J.E. Meade, Planning and the Price Mechanism (London: 
George Allen and Unwin, 1948). 

5 ) F. M. Taylor, “The Guidance of Production in a Socialist State,” American 
Economic Review, Vol. 19 (1929), No. 1, reprinted in B. Lippincott (ed.), On 
the Economic Theory of Socialism (Minneapolis: University of Minnesota Press, 
1938), pp. 41 — 54, particularly pp. 50—54. 

6 ) O. Lange, “On the Economic Theory of Socialism,” Review of Economic 
Studies, IV, Nos. 1 and 2 (1936-7) reprinted in B. Lippincott, ibid., pp. 57 — 142, 
particularly pp. 70—98. 

7 ) P. A. Samuelson, Foundations of Economic Analysis (Cambridge, Mass.: 
Harvard University Press, 1947) pp. 269 — 75. 

8 ) We follow the discussion in Walras, op. cit.. Lesson 22, which treats the 
adjustment process when there are both consumers and producers. In this 
lesson, it is also assumed that there is only one output for each process and 
only one process for producing each commodity (i.e., the assumption of fixed 
production coefficients), but this assumption is irrelevant to our purposes. 
Later, Walras considers the production coefficients not as given but a$ derived 
by minimization of costs with a given production function (pp. 383 — 86); 
but he never successfully integrates this discussion with the earlier discussion 
of adjustment, as Jaffe points out (pp. 552 — 53). 
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economic system is in an initial state of disequilibrium is that 

(a) the price change has the same sign as the excess demand, and 

(b) the change in the scale of each process has the same sign as 
the marginal profitability. 9 ) 

Walras’ adjustment rule for firms is an inevitable consequence 
of his assumption of constant returns to scale. We cannot 
postulate that a firm will instantaneously maximize its profits 
at any given level of prices; under constant returns, the profit- 
maximizing scale may be infinite, if positive profits are possible 
at some level, or it may be that there are zero profits at all 
scales, in which case profit-maximization does not define the 
behavior of the firm, or, if profits are negative at all positive 
scales, the optimal scale is zero. Walras correctly saw that 
under these circumstances he could not prescribe instantaneous- 
profit-maximization in the way he did require instantaneous 
utility-maximization by the consumer. Despite the necessary 
character of lagged adjustment, Walras’ successors were usually 
not so careful, and a clearer understanding of the special adjust- 
ment problems connected with constant returns to scales was 
not achieved until the development of game theory and linear 
programming brought them to the fore. 

Pareto explicitly noted that a socialist system would have to 
mimic the process of competition to achieve an optimal alio- 

9 ) Walras, op. cit., pp. 253 — 54; the terminology has been changed to conform 
to that used below. The reader should be warned that Walras' presentation 
is by no means unequivocal, and the interpretation is not the only one possible. 
We have followed the statement which summarizes the lesson (22) on tatdn- 
nements in an economy with consumption and production. But the preceding 
ten pages, if taken literally, present an adjustment process only distantly 
related to the summary. Some prices are held constant while others vary 
independently to clear different markets; similarly quantities are adjusted to 
make profits zero in each industry, but the adjustment process described takes 
all prices other than the selling price as given. It is hard to believe that Walras 
meant to describe reality as if markets and firms came into equilibrium in a 
preassigned order. It is perhaps this difficulty which led Goodwin (op. cit., 
p. 5) to deny that Walras meant his adjustment process to be practical, as 
describing either reality or a device for the operation of a socialist society. 
However we feel that Walras suffered in his exposition from the crudity of his 
mathenflatical tools; he was seeking to explain a simultaneous adjustment in 
many markets and within many firms without using the concept of a system 
of differential (or difference) equations and hence was forced to resort to a crude 
formulation in which some variables changed while others are held constant. 
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cation 10 ) and, following Walras, laid great emphasis on the 
market as a computing device to solve the system of equations 
of general economic equilibrium. However, his description of the 
dynamics of adjustment is considerably less precise than Walras'. 
In the Cows (published in 1896) he indicated rather sketchily 
his agreement with Walras' description of the competitive 
process 11 ) and, while agreeing with Edgeworth that computa- 
tionally there are many ways of achieving an economic optimum, 
contends that Walras' way is the natural economic way. 12 ) 

The description in the later Manuel is more extensive but more 
obscure. The discussion of stability implies a dynamic system in 
which price responds to a difference of supply and demand. 13 ) 
However, the meaning of the supply functions for firms is never 
clearly defined. For those operating under diminishing returns, 
it is the ordinary profit-maximization rule; for others, it appears 
rather to be the rule that price equals average cost. 14 ) The 
justification for the latter however is based on free entry, which 
is itself a dynamic process akin to Walras', and should have been 
introduced explicitly. 

Taylor distinguished between primary factors and produced 
commodities. For any given set of prices of the former, the price 
of the latter are set so that price equals cost of production. 15 ) 
The demand by consumers at these prices indirectly generates 
a demand for primary factors. The prices of these are then 
increased if demand exceeds supply, decreased in the opposite 
case. 16 ) 

Taylor's rule for price-setting by producers is anything but 
clear. In the special case where constant returns to scale prevail, 
there is no joint production, and each firm produces a final 
product directly from primary factors, Taylor's rule is unequi- 

10 ) Manuel , op. cit., pp. 362 — 64. There is an amusing misprint, where he 
speaks of a collectivist society “qui ait pour but de procurer a ses membres 
le minimum d’ophelimit^” (p. 362, italics added). 

11 ) Cours d’Sconomie politique (Tome Premier, Lausanne, Paris, and Leipzig: 
F. Rouge, Pichon, and Duncker and Humblot, 1896), pp. 45—47. 

12 ) Ibid., p. 25. 

ls ) Manuel, pp. 223 — 24, 232 — 33. 

14 ) Ibid., pp. 177—79, 185—87. 

15 ) Taylor, op. cit., p. 45. 

16 ) Taylor, op. cit., p. 53. 
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vocal. However, in the absence of constant returns, average costs 
will depend upon a system of simultaneous equations involving 
the demand functions among others. Even with constant returns, 
the setting of prices by Taylor’s rule require solution of a system of 
simultaneous equations if products of some firms are used by 
others. If there are alternative processes for producing the 
same commodity, there would also be a minimization problem 
to solve. 17 ) No indication is given as to the solution of these 
problems; certainly there would have to be some degree of 
centralization, in the sense that a central productive agency 
would have to have access to all the technical coefficients, at 
least, in order to determine prices. 

Lange has been the strongest defender of the proposition that 
a socialist economy can achieve both an optimal allocation of 
resources and the computational and informational virtues of 
decentralization. He presents two adjustment models, ac- 
cording as there is or is not consumers’ sovereignty. In both 
cases, the behavior of firms is defined by the principle of marginal- 
cost pricing. That is, at given prices, the firm is supposed first 
to find for any given output the minimum cost of producing it. 
The output is then determined so as to equate marginal cost, so 
calculated, to price. The demand functions for factors are 
determined by the cost-minimization criterion. 

This formulation is clearly designed to encompass the cases of 
both increasing and decreasing returns. In the latter case, it 
corresponds in general to the single rule of choosing inputs and 
outputs so as to maximize profits. As we shall see below , 18 the 
rule is inadequate and strictly speaking incorrect for the case of 
increasing returns. Further, the rule clearly does not meet the 
problems raised by the case of constant returns, as sketched 
above. 

17 ) If there is no joint production, then the efficient choice of processes will 
be the same for all demand functions. See T. C. Koopmans, ed., Activity Analysis 
of Production and Allocation , Cowles Commission Monograph No. 13 (New 
York and London: John Wiley & Sons and Chapman & Hall, 1951), Chs. 
VII — X? pp. 142 — 73. If there is joint production, then the determination 
of the optimal set of process and therefore of prices according to Taylor’s rule 
cannot be made independently of the demand function. 

18 ) See part IV, section C. 
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The firm's behavior defines the supply and demand functions 
of firms. In Lange's first model, the demand functions of con- 
sumers are defined by utility maximization. The supply and 
demand functions for the whole market now being defined, the 
state is to vary prices in accordance with supply and demand. 
Lange recognized that it had not been proved that this dynamic 
process would necessarily converge to the equilibrium which 
corresponded to optimal resource allocation; the process might 
not converge to anything at all but instead oscillate indefinitely 
or even diverge explosively. 19 ) He suggests that the price- 
adjustment rules of the state might have to be modified in some 
way to avoid oscillations, as by taking account of the anticipated 
effects of price changes on quantities, a point which is illustrated 
below for the case of increasing returns. 20 ) 

Lange's discussion of the second model, where there is only 
one utility function, that of a Central Planning Board, does not 
clearly define the dynamic process. 21 ) The meaning seems to 
be that the Central Planning Board determines its demand 
functions from its utility function as if it were a consumer and 
then prices respond to supply and demand, as before. It is not 
clear whether the Board is subjected to a budget limitation or 
it simply equates marginal utility to price. 

Samuelson's well-known restatement of the Walrasian dynamic 
system assumes that supply and demand functions are well- 
defined. His simultaneous dynamic system is then simply the 
statement that the rate of change of each price is proportional 
to the difference between demand and supply. Though the 
essentials of the system are found in Walras and Lange, it is 
Samuelson who has first made it explicit, and doubtless our 
reading of the first two is done through the spectacles supplied 
by Samuelson. 

The development of linear programming put renewed em- 
phasis on the problems raised by constant returns in production. 
In a purely linear economy, where each process operates under 
constant returns and the utility function of the economic system 

19 ) Lange, op. cit., in 43, pp. 89 — 90. 

20 ) See part IV, paits D, E, and F. 

21 ) Lange, op. cit., pp. 90 — 93. 
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is linear, the system of computations analogous to the market 
led to indefinite oscillations, as was shown by Samuelson. 22 ) 
Indeed, this problem was the starting point of the present 
investigation for the authors. 

C. Formal Statement of the Resource Allocation Model . 

Let there be s commodities and m processes for carrying on 
production of commodities. Each process may be carried on 
at different scales, and it is not supposed in general that there 
are constant returns to scale. Let x i be the scale of the j th process; 
for most purposes, x j may most conveniently be thought of as 
the amount of output of the j th process, or an index of outputs 
if the process has more than one. Let g i0 (xj) be the amount of 
commodity i produced by the j th process when the latter is 
conducted at scale x s \ a negative value for refers to an 

input. 

Among the s commodities we will distinguish a sub-class of 
n desired commodities which enter into final uses. The remaining 
or 'primary commodities are useful only because they enter into 
the production of the desired commodities directly or indirectly. 
Let y t be the amount of desired commodity i used by the economy 
for final demands — that is, not used up in one of the productive 
processes. The economy is assumed to possess a single utility 
function which has as variables the final demands for the 
desired commodities — that is, a function, 

(1) U{y v . . .,y n ). 

The total output of commodity i by the productive processes is 
25=i I* 1 addition, some commodities (particularly natural 

resources and labor) are available in positive quantities without 
production. Let be the amount of commodity i available 
initially; will, of course, be zero for most commodities. A 
primary commodity for which f < = 0 is usually referred to as 
an intermediate commodity, but we shall not need to make a 

22 ) 13 an unpublished memorandum of 1950 ; see R. Dorfman, P. A. Samuelson, 
and R. Solow, Linear Programming and Economic Analysis (New York, Toronto, 
and London: McGraw-Hill, 1958), fn. 1, p. 63. An example is given in part II, 
section D. 



44 ESSAYS IN ECONOMICS AND ECONOMETRICS 

distinction between primary and intermediate commodities. 
In order for a given set of final demands y lt . . ., y n to be feasible, 
it is necessary that they do not exceed the total available from the 
productive sector plus the initial availabilities; it is also necessary 
that the total output of the primary commodities plus that 
initially available be at least zero. Hence, 


m 


(2.1) 

Vt&IiM + ti (*' = i. • ■ 

1 

m 

., n), 

(2.2) 

0 ^ 2 St A x i) + (i = n + 1, . 

3—1 

■ s). 


Here we designate the desired commodities as 1 , . . ., n and the 
primary commodities as n — 1, . . ., s. The resource allocation 
problem is then to choose y v . . y n and x v . . ., x m so as to 
maximize U(y v . . ., y n ) among all sets of variables which satisfy 
the feasibility conditions (2). 

It is to be noted that we have assumed the absence of external 
economies and diseconomies (as between processes), since the 
inputs and outputs g i} do not depend upon the scale of any 
other process than the j th . 

We have presented the production sector of the resource 
allocation model in the form of processes rather than the more 
usual production functions. In this, we follow Koopmans 23 ) 
who argues persuasively that the latter is a derived concept 
which already implies some elements of optimization. The 
formulation in terms of processes is also preferable if the model 
is interpreted as referring to optimization within a large firm. 
However, we generalize Koopmans’ model by admitting non- 
linear functions g iP so that the effects of non-constant returns 
to scale can be studied. 

For future reference, note that the variables y x , . . ., y n , 
x p . . ., x m are necessarily non-negative from their very defini- 
tion. 


s ®) T. C. Koopmans, "Analysis of Production as an Efficient Combination 
of Activities," ch. Ill in Activity Analysis, pp. 33 — 97, especially pp. 33 — 34. 
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II. MATHEMATICAL BACKGROUND 

A. Some Notes on Unconstrained Maxima . 

1. Necessary Conditions. 

The numbers z v . . z 9 form the [unconstrained) maximum of 
the (real valued) function/^, . . z v ) if f(z lf . . ., z v ) f[z x> . . z 9 ) 
for all possible combinations of values of z v . . ., z v . It is per- 
missible to think of the z/s as being restricted to non-negative 
values, if appropriate. 

For an economic example of unconstrained maximization in 
the present model, consider the manager of a single process, 
say the j th , who buys and sells the commodities at prices 
fiv * - ->Pn which are given to him. His profit, then, is 

n 

(3) Xifa) =2Pigii{ x i)- 

i—1 

In a competitive world, the manager seeks to choose x 3 to 
maximize (3). Of course, this is an example where there is only 
one variable. 

We are not primarily interested in unconstrained maxima as 
such, but a brief discussion of some points connected with them 
will serve to illustrate developments in the more complicated 
case of constrained maxima with which we are more concerned. 

It should first be noted that an arbitrary function need not 
have a maximum. For example, in (3), if each of the functions 
giffa) is linear, then so is (recall that the prices p t are 

taken as given numbers, not variables as far as the process 
manager is concerned). If the linear function n s {p s ) has a positive 
slope, then n j can be made as large as desired by making x i 
sufficiently large, so that no finite value can be designated as 
the maximum. Even a non-linear function need not have a 
maximum. 

Second, it should be noted that a function may have more 
than one maximum. An extreme case is a constant, for which 
all points are maxima; thus if the profit function fyfa) is every- 
where*zero, a situation which arises in linear programming, all 
values of x i maximize it. 

Third, we have defined a maximum with respect to all per- 
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missible variations in the variables, sometimes referred to as a 
global maximum. It is frequently useful to use the weaker concept 
of a local maximum, a set of numbers z v . . ., z v such that 
f(z v . . ., z„) ^ /(%, . . z 9 ) for possible values of the z-s in a 
small neighborhood of z v . . z P . A global maximum is clearly 
a local maximum but not, in general, conversely. In this work 
the term, “maximum,” when unqualified, will refer to a global 
maximum. 

If the function / is differentiable and has a local maximum 
and the variables z i are unrestricted as to sign, then, as is well 
known, the partial derivatives 3 fldz { = 0 for all i = 1, . . ., p 
when evaluated at the maximum. We shall use the notation 
f„ t for 3 f/dZf in general and / 2 . for f, t evaluated at z v . . ., z p . 
Then a necessary condition for a local (and hence for a global) 
maximum when the z/s are not restricted as to sign is that 
4 = 0 (*■ = 1 p). 

This last condition is, of course, not sufficient even for a 
local maximum, let alone a global one. The condition also holds 
at minima and indeed at some points which are neither maxima 
nor minima. 

2. Concave Functions and Sufficient Conditions. 

The condition that the derivatives be zero does become a 
sufficient condition for a global maximum if the function 
f(z v . . ., z p ) is restricted to a special class, known as concave 
functions. If there is only one variable, a concave function is one 
in which the slope is never increasing as the variable increases. 
A function of one variable is said to be strictly concave if the 
slope is decreasing; such a function is illustrated in Figure 1. 
A concave function which is not strictly concave differs only in 
that the graph might have linear segments. It is easy to see 
graphically that the point z at which the derivative is zero is 
indeed the maximum. 

In Figure I it can be seen that if we take two points, such as 
z and z', the part of the graph between the two points lies above 
the line segment joining them. This property can serve tfi yield 
definitions of concavity and strict concavity for functions of any 
number of variables. 
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Definition 1 . The (real- valued) function f(z v .... z 9 ) is said 
to be concave if for any two points (z v . . ., z v ) and {z[, . . ., z' v ) 
and any third point (z'jf, . . ., z") such that z” = 6z t + (1 — 6)z' t 
(i = 1, . . ., p), where 0 is a real number between 0 and 1, 
f(z”, . . ., z ;') ^ 0 f(z v . . z v ) + (1 - 6)f(z[, . . z' p ). The func- 
tion f(z v . . z p ) is said to be strictly concave if the strict inequality 
holds in the last statement. 

Note that a linear function is concave but not strictly concave. 

In the case of a single process in a competitive world, where 
the function (3) is to be maximized, then the condition dnjdzj = 0 
is both necessary and sufficient for maximization if the function 
7 i j (Xj) is concave and differentiable. That is, the point of maxi- 
mum profit is that for which the marginal ‘profitability is zero. (If 
the function n^xf) is not concave, then this need not be true; 
in that case, the point where the marginal profitability is zero 
might conceivably be a point of minimum profit.) Suppose 
now that each of the functions g^xf) is concave; then it is easy 
to see that, since the prices pi are presumably non-negative, 
the function n s {Xj) is also concave. Thus the concavity of the 
functions gulxf) insures that the condition of zero marginal 
profitability (price equals marginal cost) is equivalent to profit 
maximization. 
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What is the meaning of the condition that the functions 
are concave? For simplicity, assume that the process has 
only one output, say commodity 1, and that the scale of the 
process is measured by that output so that g l3 (xj) = x s (note 
that this function, being linear, is certainly concave). Since the 
other commodities are inputs, g i3 (x } ) is the negative of the input 
of commodity i for i > 1. Since also g 43 (0) = 0 (for zero output, 
zero inputs suffice), it is easy to see that concavity of the 
function g i3 (aq) means that the input of the i th commodity 
increases at least as rapidly as the output, which is the same 
as non-increasing returns. Strict concavity would imply diminish- 
ing returns, while the requirement of concavity alone permits 
constant returns. 

3. The Gradient Method. 

Now consider the problem of finding an unconstrained maxi- 
mum by a process of successive approximations, starting from 
some given first approximation, say . . ., z®. One possible 
procedure is to vary each coordinate separately in such a way 
as to increase the function f(z v . . ., z p ). Thus, suppose df/dz i > 0 
at the initial point. Then it is reasonable to increase z t somewhat; 
the contrary would be true if dfjdz i < 0. The same reasoning 
applies to each coordinate. Thus a reasonable process would call 
for picking a new point z\, ■ ■ in such a way that, 

(4) z]~Zf has the same sign as f Z( (z $, . . ., z\) for each i— 1, f. 

The same procedure can be applied again starting with . . ., z p ; 
indeed, such repetition is the essence of an iterative procedure. 
Thus, at the step, 

(5) ^ +1 — z*has the same sign as f l% (z\, . . ., z\) for each t=l, 

The method specified by (5) is a method of finite differences; 
mathematically, it is simpler to assume that the process of 
adjustment takes place continuously rather than in the small 
steps implied in (5). To effect this, the finite difference z* +1 —zj 
should be replaced by the corresponding derivative* dzjdt, 
that is, the rate of change of the i th coordinate with respect to 
(computational) time. The requirement in (5) is then replaced 
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by the condition 

(6) dzjdt has the same sign as f Z{ (z v . . z,J. 

Finally, the simplest way to insure that (6) holds is to require 24 ) 

(7) dzjdt = k t ( df/dzj {i = 1, . . j>), 

where k t is a positive constant. Relation (7) defines a system of 
differential equations; the solution of this sytem defines each 
coordinate as a function of time, and this is a description of the 
adjustment process. We shall refer to (7) as the gradient method . 25 ) 
The constant k 2 is the adjustment speed. 

It is easy to see that we can assume = 1 without loss of 
generality simply by changing the units in which z { is meas- 
ured. For let z\ — a^; then dzjdt = a^zjdt, while df/dz^ = 
(I /a i )(df/dz i ), so that (7) becomes 

dzjdt = a\ki ( dfjdz'J ) . 

We can choose a i so that a\k t — 1. In this form, with which we 
shall be mostly concerned, the gradient method becomes 

(8) dzjdt = dfjdZi- 

For this method to be acceptable, it is necessary that the solution 
of (8) converge to the maximum values z v . . ., z v . If we look 
again at Figure 1, it is intuitively obvious that the process (8) 
will indeed converge to z; for if the starting point z° is below z, 
the rule (8) calls for increasing z up to the point z where the 
derivative is zero but not going beyond. In general, we can make 
the following statement: 

(9) If f(z v . . ., z v ) is a strictly concave function, then the gradient 
process (8) converges to the maximum point (z lt . . ., z v ). 

24 ) tzj z i> • • •> z v) and df/dzi have the same meaning. 

25 ) See H. B. Curry, “The Method of Steepest Descent for Non-Linear 
Maximization Problems,” Quarterly of Applied Mathematics, Vol. 2 (1944), 
258 — 61, who discusses finite difference methods; C. B. Tompkins, “Methods 
of Steep Descent,” ch. 18 in E. F. Beckenbach, ed., Modern Mathematics for 
the Engineer (New York: McGraw-Hill, 1956), pp. 448—79. A brief discussion 
is also fqjind in Samuelson, Foundations, pp. 301 — 2. Strictly speaking, (7) is 
only a special case of the gradient method (sometimes called the method of 
steepest ascent), but since it is the only form with which we shall be concerned, 
no confusion will result. 
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(The requirement of strict concavity instead of concavity is not 
basic, but the statement of (9) would have to be more com- 
plicated otherwise.) 


f(z) 



If f(z v . . ., z P ) is not concave, we might have the situation 
of Figure 2; there, if the starting point z° is below z* or between 
z* and z, the process will converge to the local maximum z* 
rather than to the global maximum. 26 

4. A Limiting Form of the Gradient Method 

We can imagine in equation (7) that some of the adjustment 
speeds k t approach infinity. This is equivalent to saying that 
some of the variables, say z lt . . ., z r , are adjusted instantaneously 
to values which maximize / for given values of the remaining 
variables z r+1 , . . ., z v . This may be meaningful in situations 
where the function f(z v . . ., z B ) is of such a simple form with 
respect to z v . . z r that it is computationally practical to find 
the maximum with respect to those variables by some fairly 
direct method. The other variables z r+x , . . z B are varied in 
accordance with equations (7), which, by a suitable choice of 
unit, can be written in the form (8), as has already been ex- 
plained. The adjustment process then has the form, 

(10) z v ...,z r maximize f[z v . . ., z r , z r+1 , . . z v ) for given z r+1> . 

© 

2e ) The gradient process for unconstrained maxima will converge in general 
to some point at which all derivatives are zero. 
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(11) dzjdt = df/dZf, i = r + 1, . . p. 

As the variables z T+1 , . . z p change, the values of z v . . z r 
which maximize f(z v . . ., z p ) will usually change too, 27 ) so that 
all the variables are actually changing in the process. If we as- 
sume that the function f(z 1 , . . ., z v ) is concave in all the variables, 
it is necessarily concave in z lt . . ., z r , so that (10) is equivalent to 

(12) df/dZf = 0, * = 1, . . r. 

The adjustment process defined by (11) and either (10) or (12) 
sounds very reasonable, at least if the equations (12) are com- 
putationally practical to solve. The assumption that in a dynamic 
system some variables adjust slowly while other variables adjust 
virtually immediately to the first set is a not uncommon one in 
economics; we shall see its economic interpretation in a resource 
allocation context later. 

This process has one difficulty which has some implications 
for the structure of the adjustment process in optimal resource 
allocation. It is possible that for some values of z r+1 , . . ., z p , 
the function f(z v . . ., z p ) might not have a maximum with 
respect to z lt . . ., z r ; that is, f(z 1 , . . ., z r , . . ., z p ) might increase 
indefinitely as one or more of the z- s (i = 1, . . ., r) increase 
to infinity. This is true even if the function f(z v . . ., z v ) is 
strictly concave. To be sure, if z T+1 , . . ., z v have the values 28 ) 
z r+1 , . . ., Zj,, then the function f(z v . . ., z p ) has its maximum 
value when z i = z i (i = 1, . . ., r), so the maximization process 
is well defined there. In general, the problem does not arise if 
z t+i> • ■ ■> z v are sufficiently close to the maximizing values 
z r+1 , . . ., z p , but it might arise otherwise. 

We are also implicitly assuming that the maximum, when it 
exists, is unique. 

Subject to these qualifications, however, the limiting form of the 
gradient process defined by (11) and either (10) or its equivalent 
(12) has the same satisfactory convergence properties as those 
of the gradient process (8). Parallel to (9), we can assert, 

37 ) I.e* there is a set of functional relations z l = ■a 1 (.! r+1 , . . . z v ) 

z r = z r( z r+i> • • ■> z s>) such that . . .,z, determined from these relations will 
satisfy (10). 

**) We recall that / is maximized at the point z = (z x , z t , . . i r , z T+l , . . S p ). 
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(13) if f{z v . . ., z 9 ) is strictly concave, then in a region suf- 
ficiently close to the maximum so that equations (12) are 
solvable throughout, the process defined by (11) and (12) 
converges to the maximum point (z lt . . ., z 9 ). 

5. Non-negative Variables. 

The preceding discussion has assumed that the variables 
z v . . ., z 9 are unrestricted in range. However, as noted in part I, 
section C, in our resource allocation problem, we are primarily 
interested in the case where all variables are required to be non- 
negative. Consider, for example, the maximization of (3). If 
the maximizing value of x , is positive, then, indeed, the derivative 
has to be zero. Suppose however, nfa) has its maximum at 
zero; that is, operating the process at any positive level involves 
a smaller profit (or greater loss) than not operating at all. 
This implies that at zero, the marginal profitability cannot be 
positive, but it might be negative, as illustrated in Figure 3. 


f(z) 



The same considerations hold when there are several variables. 
We may thus conclude, 

(14) f Z( <; 0 for i = 1, . . ., p; if J Z( < 0 for some i, then z t — 0. 

Relations (14) are in general only necessary conditiofis for a 
local maximum in general. If the strict inequality never holds, 
then, as we have seen in paragraph 1 of this section, the point 
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may be a local minimum. Even if the strict inequality does hold 
for some components but not for others, a point satisfying (14) 
may be neither a maximum nor a minimum. Even if it is a 
local maximum, it need not, of course, be a global one. 

However, as in paragraph 2 of this section, relations (14) are 
a sufficient condition for a global maximum if the function 
f(z i, . . ., z p ) is concave, as illustrated for one variable in Figures 
1 and 3. 

The gradient method described in (8) also requires some 
modification when we deal with non-negative variables. If, at 
some point in the process, one of the variables z i is zero and if 
at the same point 3 ] fjdz i < 0, the unmodified gradient method 
(8) would require z i to decrease further, that is become negative, 
which would make no sense if the variable is intrinsically non- 
negative. This would happen, for example, in Figure 3, if the 
process has reached the point z = 0. To prevent this, we must 
add to (8) the rule that in this case the variable remains at zero, 
so that its value does not change, i.e., dzjdt = 0. 

The gradient method then takes the following form: 


(15) 


| 0 if z i = 0 and df/dz i < 0, 
l df/dZi otherwise. 


It is easy to see that in order for the process (15) to stop, that is, 
for dzjdt to equal zero for all i, it is necessary and sufficient 
that the conditions (14) hold. In other words, the maximum 
z v . . z p is the equilibrium point of the system of differential 
equations (15). 

In the limiting form of the gradient method, (14) is applicable 
to the variables z v . . ., z ri in view of (10) applied to the case of 
non-negative variables. Similarly, for variables z ri . v . . ., z v , (11) 
is replaced by (15). The method becomes, then, 

(16) df/dZi ^ 0, with 3 f/dz t < 0 only if z t = 0, for i = 1 , . . ., r\ 


(17) 



0 if z { = 0 and dfjdz i < 0, 
dfjdz t otherwise, 


for i = r + 1, . . p. 

In both forms of the gradient method, convergence is assured 
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under the same circumstances as in the case of variables un- 
restricted as to sign. 

Finally, for future reference, note that minimization calls for 
the same rules as maximization, with appropriate changes of 
sign. Thus, instead of (14), we have for the minimum z, 

(18) l t ^ 0 for i = 1, . . ., p\ if / 2j > 0 for some i, then z t = 0. 

Again, (18) is a necessary condition for a minimum in general 
and a sufficient condition if f(z v . . ., z v ) is convex. 29 ) The gradient 
method (15) applied to a minimum becomes. 


(19) 



0 if z { = 0 and df/dz, > 0, 
—dfjdZi otherwise. 


B. Constrained Maxima and Saddle-points. 

The numbers z v . . ., z p form the constrained maximum of 
f(z v . . ., z„) subject to the constraints 


(20) &(*i, . . z 9 ) ^ 0 s), 

if f(z v . . ., z p ) ^ f(z v . . ., z p ) for all combinations of values of 
z„ which satisfy (20). Again, we may and will restrict 
ourselves to non-negative values. The resource allocation problem 
of part I, section C deals with such a constrained maximum; 
here the z’s are interpreted to include the x’s and the y’s, f (z v . .., 2 j 
is identified with U(y v . . ., y n ) (the fact that U does not depend 
upon all the z’s does not create any difficulty), and 


(21) gi (%,..., z p ) = 


1 gufri) + - Vi 

2=1 
m 

IsM + 

'i - 1 


(i — 1, • • n) 

(i = n + 1, . . ., s). 


When the constraints (20) are in the form of equalities rather 
than inequalities and the variables are not restricted to be non- 
negative, the classical method of Lagrange multipliers supplies 
a necessary condition for a constrained maximum. That is, 
in order that z v . . z„ maximize f(z v . . z p ) subject to 


29 ) A convex function is one whose negative is concave; that is, / is convex 
if —/is concave. 
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gj(z v ...,*„) = 0 (/ = 1, . . ., s), it is necessary that there exist 
p s such that 

s 

^ 22 ) = ® (* = • • •» P)> 

ii(* i. • • •> **) = 0 (/ =!»•••. s). 

Here, g, >2< = dg,/dz { . This condition has been much used in 
economics. 30 ) Thus, the theory of consumer’s behavior makes 
use of a special case of (22), where z/s are the amounts of 
different commodities purchased, f(z v ..., z p ) is the utility 
derived from a given bundle of commodities, and there is just 
one constraint, the budgetary constraint, which can be written 
in the form 

M-2qiZi = 0, 
i= 1 

where q t is the price of commodity i and M is total income. In 
this case, g hz% = — q % \ the Lagrange multiplier is interpreted 
as the marginal utility of income. 

Another application of (22), this time using more than one 
constraint, occurs in Lange's development of the theory of welfare 
economics. 31 ) 

Equations (22) can be rewritten in an interesting way if we 
introduce the Lagrangian L, defined as, 

(23) L(z 1( . . ., z„; . . ., p n ) =f(z 1 , . . ., z 9 ) + 2Piii( z v ■ • •» * „)• 

3=1 

It is easy to see that, 

(24) dL/dZi = f Z{ + 2 p 3 g jtti , dLjdpt = gfa . . ., z„). 

j—1 

In view of (24), the condition (22) which is necessary for a 
maximum is that, 

(25) L Zi = 0, L„ t = 0, i = 1, . . ., p, j = 1, . . ., s. 

This form is rather suggestive. If we take, for example, the p/s 

80 ) SeeP R. G. D. Allen, Mathematical Analysis for Economists (London: 
Macmillan, 1938) pp. 364 — 83; Samuelson, Foundations, pp. 262 — 64. 

81 ) See O. Lange, “The Foundations of Welfare Economics,” Econometnca, 
X (1942), 215—28. 
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as given numbers, with p } = f> } for each j, then (25) shows that 
the z/s satisfy at least a necessary condition for a maximum or 
minimum . Before following up this hint, let us reconsider the 
constrained maximum problem in the form first proposed. 

We want to change the problem which leads to (25) in two 
ways: the z/s are to be restricted to be non-negative, and the 
constraints are to be inequalities, as given in (20). Our presenta- 
tion here is based on the important study by Harold W. Kuhn 
and Albert W. Tucker. 32 ) Analogous to (14), it can be shown 
that the requirement that the z/s be non-negative requires 
modifying the first half of (25) to, 

(26) L Z{ <> 0 (* = 1, . . ., p); if l„ t < 0, then S t = 0. 

Considering the constraints as inequalities requires two 
modifications. First, suppose that, for some j, gj(z v . . S p ) > 0. 
Then this particular constraint is ineffective, in that the choices 
of the z’s could have been slightly modified without violating 
the constraints, but it was not found profitable to do so. Since 
the solution to the constrained maximization problem should 
be the same if an ineffective constraint is dropped from the 
problem completely, we would expect (correctly) that, 

(27) if &(*!, . . ., z p ) > 0, then § s = 0. 

The second point is that the p/s must be non-negative. The 
point of this can be seen by considering a single restraint and 
giving an economic interpretation to the problem. Let z v . . ., z p 
be some economic variables (perhaps outputs), f(z v . . ., z p ) 
the return from them in utility or money, and g(z lf . . z p ) 
the excess supply of some resource, that is, the initial amount 
available of that resource less the amount needed for the choice 
of variables z v . . ., z p . The condition that g(z v . . ., z p ) ^ O' 
simply amounts to saying that more cannot be used of the 
resource than is available. Assume, for simplicity, that the f/s 
are all positive, and that the constraint is effective; then, 

( 28 ) fz 1 + Hz, = 0, g(z v . . z p ) = 0. 

S2 ) See H. W. Kuhn and A. W. Tucker, "Nonlinear Programming,” in J. 
Neyman, ed., Proceedings of the Second Berkeley Symposium on Mathematical 
Statistics and Probability (Berkeley and Los Angeles: University of California 
Press (1952), pp. 481-92. 
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Consider any movement of the z/s which will increase the return 
f(z v . . ., z p ) as compared with the return when the variables 
take on the values z v . . ., z p . For example, suppose a small 
increase in z x will increase f(z v . . ., z p ). It must be, then, that 
such an increase is ruled out as being infeasible in that it violates 
the constraint g(z x , . . z P ) 0, for if it were feasible, the 
z ’ s could not be the constrained maximum. That is, an increase 
in z 1 will increase f(z v . . ., z p ) but must decrease g(z v . . ., z„) 
below zero. This requires that, 

(29) f H > 0, g Zi < 0; 

in view of the first part of (28), we must have p > 0. 

In economic terms, we can imagine that instead of explicitly 
restraining an economic unit to undertake only feasible policies, 
we permit any decisions (whether feasible or not) on the z/s 
but require it to pay a price p on the amount of the limited 
resource used. We also let it use any part of the initial supply 
of the resource or sell any part at the price p. Then the unit’s 
net return can be thought of as f(z v . . ., z p ) + p g(z v . . ., z„), 
which is the Lagrangian. Then the above asserts that, by 
choosing p to have the non-negative value p, the unit will be 
constrained not to violate the feasibility condition. 

If we combine (26), (27), and the preceding discussion, with 
the aid of (20) and the second half of (24), we have the following 
theorem, due to Kuhn and Tucker: 

Theorem 1. A necessary condition that z lt . . ., z v be a 
constrained local maximum of f(z v . . ., zj subject to the 
constraints gj (z x , ...,*,) ^ 0 (j = 1 , . . ., s), with all variables 
non-negative, is that, 

(a) z f ^ 0; L Zj ^ 0; if L Z( < 0, then z ( = 0 (*=!,.. ., p); 

(b) p } ^ 0; L Pf ^ 0; if L Pj > 0, then § t = 0 (j = 1 , . . ., s). 33 ) 

83 ) Strictly speaking, the above theorem is true only if the constraint functions 
gf(z lt . . z v ) satisfy an additional condition, such as that referred to by Kuhn 
and Tucker as the Constraint Qualification (op. cit., pp. 483 — 4). A simple 
condition of this type (used by M. Slater is an unpublished paper) is that there 
exist some z v . . g 9 such that g j (z 1 , . . z v ) > 0 for ally; in economic terms, 
that it be possible to choose the economic variables so that there is an excess 
supply of all resources. 
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Remark. It is to be stressed that Theorem 1 yields only a 
necessary condition for a local maximum. The point is precisely 
analogous to that which has already been made for unconstrained 
maxima. Except for the strict inequalities in condition (1), a 
constrained local minimum would satisfy Theorem 1; even with 
the inequalities, a point satisfying conditions (a) and (b) might 
be neither a maximum nor a minimum locally. Further, even if 
we have a local maximum satisfying the conditions of Theorem 1, 
there is, in general, no assurance that it will be a global maximum. 

Additional conditions on the functions involved are needed 
to insure that the conditions of Theorem 1 suffice for a global 
maximum. In view of the corresponding results for unconstrained 
maxima, it is not surprising to find that conditions (a) and (b) 
are sufficient for a global maximum if the functions f(z v . . z p ) 
and g j (z 1 , . . ., z v ) ( j = 1 , . . ., s) are all concave functions. 

Closely related to this problem is that suggested by the 
economic interpretation given in the discussion leading up to 
Theorem 1, namely whether or not the economic unit can be 
thought of as choosing z v . . z v so as to maximize the Lagran- 
gian when the p/s are set equal to p v . . ., p s . Condition (a) 
shows that a necessary condition for a maximum must be satisfied 
(recall condition (14)). It is certainly not always true that the 
z/s maximize the Lagrangian given the p/s; in economic terms, 
it is not always true that the optimal allocation of resources will 
be achieved by maximizing profits even when the prices of 
resources are properly set. 34 ) But there is an important class of 
cases when in fact we can think of maximizing the Lagrangian 
with respect to the z/s. Indeed, we know from earlier discussion 
(section A.2, this part) that condition (a) of Theorem 1 is 
sufficient for a maximum if the Lagrangian is a concave function 
of the z/s (taking the p/s as given at . . ., p s ). In turn, we 
can say that this condition is satisfied if f(z v . . ., z p ) and 
g](Zi, ■ ■ ., z„) (j = 1, . . ., s) are all concave functions, for then 
L(z v . . z p ; p v . . ., p s ) is a combination of concave functions 
with non-negative coefficients, and any such combination is 
again concave. 

34 ) For an emphatic statement of this viewpoint, see Samuelson, Foundations, 
pp. 230—31, 234—35. 
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(30) If the functions f(z v . . z 9 ) and gi(z v . . ., z p ) are concave, 

then the maximum of L(z v . . z p \ p lt . . ., p n ) is achieved 
by setting z 4 = z t (i = 1, . . p). 

In view of the parallelism between conditions (a) and (b) 
of Theorem 1, it is natural to ask if the p / s minimize the Lagran- 
gian given that z € == z { for all i. Here, this is clearly true, because 
the Lagrangian is a linear function of the p/s and all linear 
functions are convex (as well as concave), so the result follows 
from (18). 

(31) The minimum of L(z v . . z p ; p v . . p s ) is achieved by 
setting p s = pt (/ = 1, . . ., s). 

Statements (30) and (31) suggest the convenience of the 
following definition as applied to any function depending upon 
two sets of variables (which we will here regard as non-negative). 

Definition 2. The function 0(z ly . . ., z p ; p v . . ., p 8 ) is 
said to have a saddle-point at (z v . . ., z p ; p ±i . . ., p 8 ) if 
0{z ly . . ,,z p ; p 8 ) has its maximum in z v . . ., z v at 

z v . . ., z p and &(z v . . ., z v ; p lf . . ., p s ) has it minimum in 
P\y • • *J Ps ^ Pl> * * * iPn * 

The concept of a saddle-point was used by von Neumann 
and Morgenstern in connection with the theory of zero-sum 
two-person games. 35 ) The definition just given is not the most 
general possible, but it is sufficient for the present purposes. 
It is useful to think of a game in which player I chooses p real 
numbers, z v . . z 9 , player II chooses s real numbers, p v . . ., p 8i 

the two choices being made independently, and the amount paid 
by player II to player I is given by the function 0(z v . . ., z v \ 
Pi> • • •> Ps)- negative value means that player I pays 
player II.) Clearly player I wishes to maximize the function 
0{z ly . . ., z p ; p lt . . p s ) with respect to the variables z v . . ., z p 
at his control for any given choice of plaj^er IFs variables, while 
player II wishes to minimize with respect to his variables. 

The discussion can be summarized in the following theorem 
due to Kuhn and Tucker: 

35 ) J. von Neumann and O. Morgenstern, Theory of Games and Economic 
Behavior (1st ed.; Princeton, New Jersey: Princeton University Press, 1944), 
p. 95. 
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Theorem 2. If f{z x , . . z„) and z 9 ) [j = 1, . . n) 

are concave functions of the non-negative variables z x , . . z v , 
then a necessary and sufficient condition that z x , . . z v 
ma xim ize f(z x , . . z„) subject to the constraints g, (z v ...,z v )^ 0 
(j = 1, . . s) is that there exist numbers ft x , •■■>§$ such 
that the Lagrangian, 

S 

L(Z V . • p s ) = f(% i) • • *i ^aj) -f- ^ Pjgj{% l> • • •> ^j))» 

i=l 

has (z v . . z p ; p l9 . . p s ) as a saddle-point with all variables 
being regarded as non-negative. 36,37 ) 

Theorem 2 has considerable economic interest from a static 
point of view, i.e., as a characterization of the optimal resource 
allocation. However, we are here more interested in its im- 
plications for a process of successive approximations. In the 
saddle-point problem, the variables are unconstrained (except 
that they must be non-negative); hence, it appears possible 
that a gradient method will make some sense. The equivalence 
of the saddle-point and constrained maximum problems can then 
be used to apply the resulting process to the latter problem. 

C. Local Saddle-point Conditions for Constrained Maxima. 

Before discussing the gradient method for saddle-points, we 
will present some local theorems analogous to Theorem 2. If 
the concavity conditions are not satisfied, we cannot expect 
to be able to state simple conditions which could characterize a 
global maximum or distinguish it from alocalmaximum. However, 
it would at least be worthwhile to find necessary and sufficient 
conditions for a local maximum. Theorem 1 does not satisfy this 
requirement: it only states necessary conditions. 

38 ) Again, the above theorem is true only if the constraint qualification holds; 
see footnote 33. 

37 ) The equivalence relation between saddle-points and constrained maxima 
was previously studied in the case where the functions f{z lt . . z p ), g^z^ . . ., z P ) 
are linear, that is, in the case of linear programming, by D. Gale, H. W. Kuhn 
and A. W. Tucker, “Linear Programming and the Theory of Games,” ch. XIX 
in Activity Analysis, pp. 317 — 29, and G. B. Dantzig, “A Proof of the Equivalence 
of the Programming Problem and the Game Problem,” ch. XX, Hid., pp. 
330 — 35. The Kuhn-Tucker theorem given in the text presents a somewhat 
different form of the relation between the two types of extrema as well as an 
extension to nonlinear maximand and constraints. 
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Theorem 2 suggests that a local constrained maximum might 
be characterized by the local saddle-points of a Lagrangian. 
This is not true if we take the Lagrangian in the form in which 
it has been used, but it is true for modifications of the original 
Lagrangian, of which we present two. First, we will show by 
an example that a local constrained maximum is not in general 
a local saddle-point of the Lagrangian. Let the maximand be 
f(z v J8f a ) = z\ + z\ and the constraint g(z v z 2 ) = 1 — z x — z 2 ^ 0. 
It is easy to see that the maximum, for non-negative variables, 
is attained at two points, (0, 1) and (1, 0). Let us consider the 
second. The Lagrangian is, 

(32) L(z v z 2 ; p) = z? + z\ + p[l — z 1 — z t ). 

By Theorem 1, at the maximum, L H = 0 (since z x = 1 > 0), 
so that p = 2. If the optimal solution were a local saddle-point, 
the point (1,0) would be a local maximum of, 

(33) L(z v z 2 ;p) = z\ + z\ + 2(1— *!—*„) = fo— I) 2 + (*,— l) 2 . 

But clearly any change in z x would increase (33), so that the point 
cannot be a local maximum of (33) and therefore not a local 
saddle-point. 

Our program, then, is to search for a modification of the 
Lagrangian so that a constrained maximum will correspond to 
a saddle-point. First we remark that no difficulty occurs with 
respect to the minimization part of Definition 2, as we have 
already seen in equation (31); that is, the p/s minimize the 
Lagrangian L(z v . . ., z p ; p v . . ., p s ) with respect to the p/s 
regardless of assumptions about the functions f(z v . . ., z p ), 
g,( 2 x , . . ., zj. This is because the Lagrangian is linear in the 
p/s. Our concern therefore is to modify the Lagrangian so that 
it be locally concave in the z/s when the p/s are set equal to 
their equilibrium values. More precisely, we shall seek to make 
the Lagr an gian strictly concave locally in the z/s; the strict 
concavity will be important in the applications of the gradient 
method to be discussed in the next section. We will refer to 
Lagraflgians modified so as to have the desired concavity 
properties as concavified Lagrangians. 

Our first method for concavifying Lagrangian is based on 
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the remark that the constraints in a given maximization problem 
can be described in different ways. This is analogous to the well- 
known proposition in consumers’ demand theory that monotone 
transformations of the utility function leave the demands 
unchanged, the point being that the set of variables which 
ma xi miz es a function also maximizes any increasing function 
of it. In the same way, it is possible to transform the constraints 
without changing the set of values of the variables which satisfy 
them. Let p t {u 3 ) be any (real-valued) function of one variable 
which preserves signs, that is, pj(u } ) is positive, negative or zero 
according as % is positive, negative, or zero, respectively. Let 
one of the constraints in a maximization problem be that 
g s (z v 0. Define a new function of the variables 

• • •) 

(34) g* (z v . . zJ Pj ) = pfafo . . ., *„)]. 

In other words, for any given set of values of the variables z {> 
the corresponding value of g* is obtained by first computing 
gj(z v . . ., z v ), and then computing the value of />,-(%) when 
Uj is set equal to the value of g i (z 1 , . . ., z p ). Since the function 
Pi( u i) preserves signs, it follows from (34) that 

(35) S*(h> • • •> z Jpi) ^ 0 if and only if g j (z 1 , . . z p ) ^ 0. 

The constraints then are unchanged by the transformation. 
For any sign-preserving functions P]{u s ), any values of the 
variables z lt . . ., z v which satisfy the original constraints 
gj(zi, . . ., z p ) ^ 0 also satisfy the transformed constraints 
g*(z v . . ., zJpj) ^ 0, and conversely therefore in particular 
the values z v . . ., z„ which maximize f(z 1 , . . ., z v ) tmder the- 
original constraints are the same as those which maximize it 
under the transformed constraints. For the transformed problem,, 
the Lagrangian becomes 

(36) L*(z v . . ., z p ; p u . . ., pjp) 

S 

— fih. • • ■> z v) 2 PiS*( z i> • • •> z vIps) 

The necessary conditions of Theorem 1 must hold for all the- 
Lagrangians formed by transforming the constraints. 
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The problem of concavification thus becomes one of choosing 
among the Lagrangians with transformed constraints a class of 
Lagrangians for which the concavity property holds. We want 
however a class which can be defined rather broadly in advance 
of solving the problem; there is no point in merely showing the 
existence of a set of functions p 3 (%) for which the Lagrangian 

(36) is locally strictly concave in the s when p i = p 3 for all 
j if the determination of that set is as hard as the original 
maximization problem. Fortunately, it is possible to specify a 
whole class of transforming functions p 3 (u 3 ) which depend only 
on a very general knowledge of the functions involved. 

We will present the result in terms of a very specific type of 
transforming function. Note that since p 3 (Uj) is negative for 
u 3 negative and positive for u s positive, it must be increasing 
at u i = 0. Let us consider the class of functions, 

(37) Pj { Uj ) = 1 — (1 — , 

with rji an even integer which has this property. The following 
theorem can be demonstrated: 

Theorem 3. Under certain regularity conditions, for all even 
rj/s sufficiently large, a necessary and sufficient condition that 
z v . . ., z p be a local maximum of f(z lt . . .,z p ) subject to the 
constraints g j (z 1> ^ 0 is that there exist numbers 

pi , . . .,p s such that the Lagrangian, 

■^■'*(^’17 • * *> ** P , Pit . . pjrj) 

= f{* **) + 2 Pi (l-E 1 - Sii.h> ■ • •> Zv)Y +7,, )> 

has (z lt . . ., Zj,; p v . . ., p s ) as a local saddle-point, all variables 
being considered as non-negative. In particular, the function 
L*{z v ..., z p ; pit ..., pjrj) is locally strictly concave in 

Z p , 38 ) 

Notice that the transformation can be chosen from among a 
wide class since all ?y/s sufficiently large will suffice. The particu- 

8R ) For a demonstration and more complete spelling out of the regularity 
conditio^, see K. Arrow and L. Hurwicz, “Reduction of Constrained Maxima 
to Saddle-point Problems,’* in J. Neyman, ed., Proceedings of the Third Berkeley 
Symposium on Mathematical Statistics and Probability (Berkeley and Los 
Angeles: University of California Press, 1957), V, 1 — 20. 
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lar choice of transformation function (37) is not essential; the 
basic condition is that the ratio, 

ft(0)/ft(0), 

be sufficiently large. Another example is p 5 {u 0 ) = 1 — 

We will refer to the Lagrangian of Theorem 3 as the concavified 
Lagrangian with transformed constraints. It is in fact a Lagrangian 
in the ordinary sense applied to a problem which is identical 
with the original but has the constraints restated. 

Before leaving this method of concavification, we note that 
it can be applied to the situation where the maximand f(z v . . z p ) 
and the constraints g g (z v . . z 9 ) are linear. In this case, Theo- 

rem 2 is applicable, but the Lagrangian is linear in the z/s, 
not strictly concave. The latter property, as we have remarked, 
is desirable in applications of the gradient method to be discussed 
in the next section. The class of transformation functions which 
will insure something close to strict concavity of the Lagranian 
is very wide indeed, including all strictly increasing strictly 
concave functions. An application to the resource allocation 
problem in the linear ease will be made in section G, part IV. 

We now proceed to an alternative method of achieving a 
concave Lagrangian, based on a lemma due to Debreu. 39 ) In 
this case, the Lagrangian is modified by subtracting a quadratic 
function of the constraints; we will therefore refer to it as the 
concavified Lagrangian with quadratic modification. First, con- 
sider the case in which the constraints are equalities rather than 
inequalities, that is, the problem is to maximize f(z v . . ., z p ) 
subject to the constraints gj(z v . . z p ) = 0. Then consider the 
expression, 

(38) Lt(z v . . ., z p ; p x , . . ., pjk) 

8 8 

= f{*v • • •> *,) + 2P)gi(h> • • •> *,) -A 2 • • •> z .)]*. 

5=1 5=1 

which differs from the usual Lagrangian only in its last term. 
It is easy to see that the first-order conditions for a constrained 
maximum are the same for the Lagrangian with quadratic 

39 ) G. Debreu, “Definite and Semidefinite Quadratic Forms/’ Econometrica, 
Vol. 20 (1952), 296. 
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modification as for the usual Lagrangian. But it can also be 
shown, that, 40 ) 

(39) U(z l9 . . ., z p ; p t , . . -,p s /X) is locally strictly concave 
in x lt . . ., z p for all X sufficiently large. 

The case of constraints which are inequalities can now be 
covered by introducing new non-negative variables w lt . . ., w Si 
and observing that the condition g t (z l9 ...,**) 2g 0 is equivalent 
to the condition g 3 {z v . . ., z p ) — w s = 0. The constraints now 
being written in the form of equalities, we can apply (39), 
provided it is observed that the set of variables with respect to 
which maximization is carried out has been enlarged by the 
addition of the w/ s. 

Theorem 4. Under certain regularity conditions, for all X 
sufficiently large, a necessary and sufficient condition that 
z lf . . ., z 9 be a local maximum of f(z lf . . ., z p ) subject to the 
constraints g s (z v ...,*„) Sg 0 if there exist p v . . .,p 8 and 
w l9 . . w 8 (these numbers are the same for all X ), such that 
the Lagrangian with quadratic modification, 

£ f (*i, • • * f * 9 , U>1> • • * , Ws', Pv • • Pal*) 

a s 

= L(z 1 , . . z„; p t ) — 2 f,w t - A x 0 ) - wtf, 

3=1 3—1 

have (z lt . . ., z v , w v . . ., w s ; p v . . p s ) as a local saddle- 
point, all variables being considered as non-negative. In 
particular the function L^(z v . . ., z v , w v . . ., w s \ p v . . ., pJX) 
is locally strictly concave in z v . . ., z p , w v . . ., w 8 . 

Remark. When the constraints are equalities, the equilibrium 
values of the Lagrange multipliers p i may be positive or negative, 
and the condition g 3 {p lf . . ., p s ) = 0 is the same as the condition 
that L Vj = 0. Since the Lagrangian is linear in the p/s, this 
means that for equilibrium values of the z t ’ s, it is a constant 
with respect to the p/s. In particular, one can speak of it as 
taking a minimum with respect to the p/s when p i = p 3 ; here 

40 ) See K. Arrow and R. M. Solow, “The Gradient Method for Constrained 
Maxima Under Weakened Conditions/’ ch. 10 in IC. Arrow, L. Hurwicz, and 
H. Uzawa, Studies in Linear and Nonlinear Programming (Stanford, California: 
Stanford University Press, 1958), especially section 4. The result is essentially 
an extension of Debreu’s lemma to cover the case of non-negative variables. 
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the range of variation of the fi/s is unrestricted as to sign. 
The same statements hold for the Lagrangian with quadratic 
modification, since the fi/s do not enter the additional quadratic 
term. It follows that in the saddle-point statement of Theorem 4, 
one could properly say that the fi/s should be unrestricted as to 
sign, instead of being non-negative. However, it can be proved 
that when the constraints are introduced in the special form 
used there, with a slack variable w t to transform inequalities 
to equalities, a saddle-point with respect to non-negative values 
of the fi/s is also a saddle-point with respect to unrestricted 
variations. 


D. The Gradient Method for S addle- foints. 


Let us imagine that a game with a saddle-point is played 
frequently, in fact continuously. At any given trial, player I 
knows what player II has chosen the previous time. Suppose 
player I assumes that player II will make the same choice of 
his variable fi v . . ., fi a on the given trial. Then from his point 
of view, the problem is one of unconstrained maximization. He 
may then be thought of as starting a gradient process to achieve 
this maximum, so that this behavior is described by applying 
(15) to the function &(z v . . ., z v \ fi v . . ., fi s ), with the variables 
fix, . . ., fi s regarded as given. Thus, 


(40) 


fO if z t = 0 and 3^/3^ < 0, 
(30/3 z t otherwise. 


Actually, however, the fi/s are also varying. If player II is 
thinking along the same lines as player I, he will seek to minimize 
the payoff function with respect to fi v . . ., fi s , taking z v . . ., z 9 
as given. Equation (19) is applicable here, so that, 


(41) 



0 if fi } = 0 and 3#/3 fi, > 0, 
—d&fdfij otherwise. 


We wish to state conditions for the convergence of the gradient 
method defined by equations (40) and (41). We cannot simply 
apply the earlier statements about the convergence of the gradient 
process for an unconstrained maximum. Indeed, if 0(z v . . ., z„‘, 
fi x , . . ., fi a ) is strictly concave in the variables z v . . ., z v , taking 
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p x , . . ., fi s as given, then the process (40) would converge if the 
p/ s remained constant. But in fact the latter variables are 
simultaneously changing as a result of (41). We may think of 
an anti-aircraft gun shooting at an airplane. After each shot, 
the gunner may move the gun so as to correct for the observed 
error, but meanwhile the airplane is moving away from its 
previous position and in such a way as to make the error as big 
as possible. Despite the complexity of the problem, it is possible 
to show the following result, 

(42) if 0(z v . . ., z 9 ; p v . . p s ) is strictly concave in z v . . ., z v 
for each set of values of p v . . p s and convex in p v . . ., p s 
for each set of values of z v . . ., z pi then the gradient 
process described by equations (40) and (41) converges 
to a saddle-point (z v . . ., p v . . .,£,) of 0(z lf . . ., z 9 ; 
fiv-.P,)-* 1 '* 2 ) 

That somewhat stronger conditions than the simple concavity 
assumptions of Theorem 2 are needed for convergence of the 
gradient method can be seen from the following example. 43 )* 
Suppose there is just one z and one p y and, 

0(z; p) == z + p — pz. 

It is easy to verify that the unique saddle-point is z = 1, p = L. 
If we disregard for the moment the non-negativity conditions,, 
the gradient method becomes, 

41 ) The assumptions of (42) insure that there is a saddle-point and that 
the values must be uniquely defined; it is possible however to have more than 
one set of p-values. 

42 ) The fact that the £- values converge to the z- values of a saddle-point has been 
proved with local assumptions and conclusions by the authors; see K. J. Arrow 
and L. Hiirwicz, “The Gradient Method for Concave Programming I: Local 
Results,” ch. 6 in Arrow, Hurwicz, and Uzawa, op. cit. The corresponding 
theorem in the large was established by H. Uzawa, “The Gradient Method for 
Concave Programming XI: Global Results,” ch. 7, ibid. That the ^-values also* 
converge is shown in K. J. Arrow and L. Hurwicz, “The Gradient Method for 
Concave Programming III: Further Global Results with Applications to* 
Resource Allocation,” ch. 8, ibid, referred to below as, “Gradient Method 
for Resource Allocation,” see section 1. For an earlier exposition of the gradient 
method as applied to constrained maxima and to saddle-points, see K. J. 
Arrow and L. Hurwicz, “Gradient Methods for Constrained Maxima,” Opera- 
tions Research, Vol. 5 (1957), 258 — 65. 

4S ) The non-convergence of the gradient method in the linear case was. 
observed by Samuelson; see footnote 22. 
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dzjdt = 1 — p> dp/dt — z — 1. 

The solution of this system of differential equations is, 

z = 1 + (z 0 — 1) cos t + (1 — Pq) sin t, 

£ = 1 + (pQ — 1) cos t + (z 0 — 1) sin t t x 

where ( z 0) p Q ) is the initial point of the adjustment path. It is 
easy to see that if z 0 and p$ are sufficiently close to 1, the solution 
never becomes negative, so that the above is in fact the solution 
to the gradient method of (40) and (41) with non-negativity 
conditions satisfied. Also it is clear that the solution is a periodic 
function and so never converges to the saddle-point. Instead it 
cycles endlessly, neither diverging explosively not converging. 44 ) 

E. The Gradient Method for Constrained Maxima . 

We are now in a position to combine the results of sections B 
and D. In section B, it was stated that a constrained maximum 
problem was equivalent to a suitable saddle-point problem. In 
section D, it was remarked that, under certain condition, a 
saddle-point could be determined by a gradient process. We have 
merely to apply (42) to the Lagrangian of Theorem 2. 

First, when are the hypotheses of (42) satisfied by the La- 
grangian? The Lagrangian L(z v . . z v \ p v . . ., p $ ) is linear in 
pi> - • -,p s for any given set of z’s and hence certainly convex. 
Suppose that f(z v . . ., z 9 ) is strictly concave and the g/s all 
concave. Then for any given set of non-negative p/ s, the La- 
grangian is a sum of concave functions, one of which is strictly 
concave; hence the Lagrangian is strictly concave. Thus, the 
Lagrangian satisfies the hypotheses of (42) if f(z v . . ., z 9 ) is 
strictly concave and g s {z l9 . . ., z 9 ) is concave for each j. 

To describe the gradient method, we must substitute the La- 
grangian L{z 1 , . . p v . . p s ) for 0{z v . . z P ; p v . . p 8 ) 

n (40) and (41). We can then assert, 

Theorem 5. If f(z v . . z v ) is strictly concave and 
Zj,) is concave for each j, then the gradient process 

defined by, 

44 ) This fact was one of the reasons for interest in modified Lagrangians 
where convergence, rather than endless cycling, could be obtained. 
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(^3) dzjdt = 

(44) dpjfdt = 


0 if *t = 0 and f + lPiSi,z ( < 0 

a 1 

fzi + HPjgi.zt Otherwise, 

;'=i 


0 if Pi = 0 and g 3 (^, . . *,) > 0, 
— &(%, . . z,) otherwise. 


converges to a limit (z v . . ., z a ; p v . . p s ), where 
(z v . . z p ) is the constrained maximum of f(z v . . z v ) 
subject to the restraints g } (z v . . z v ) ^ 0, where the variables 
Zj are non-negative. 

The gradient method of Theorem 5 can be given an economic 
interpretation along the lines used in section B. If /(%,.. ., z v ) 
is considered to be the utility derived from setting some variables 
at levels z ]t . . ., z v , and g^z-y, . . z v ) is the excess supply of 
resource /, then p s can be regarded as the price of resource j. 
Since g UH is the increase in excess supply (decrease in excess 
demand) due to a unit increase in z it we can interpret, 

n 

j=l 


as the marginal cost attributable to a unit increase in Hence, 
the expression, 

n 

fz t "f" 2 PiSo,z t > 

j=l 


can be interpreted as the difference between marginal utility 
and marginal cost attributable to a unit increase in z it Hence 
(43) is an instruction to increase z t if the marginal utility exceeds 
the marginal cost and decrease it otherwise, with the proviso 
that the activity level z { cannot be decreased below zero. Equa- 
tion (44) has even a simpler interpretation, to increase price if 
the excess supply is negative (i.e., if demand exceeds supply) 
and decrease it otherwise, but not below zero. The theorem 
then asserts that the process just described will eventually 
converge to the activity levels which yield the highest utility 
subject to the conditions that demand never exceed supply 
for any commodity. The process will also define a corresponding 
set of equilibrium prices. 
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Theorem 5 is not in present form applicable to the resource 
allocation model of part I, section C even under the strongest 
reasonable concavity assumptions. Somewhat stronger theorems 
are presented in part III, section D. 

The proposition contained in (42) can be applied locally to 
the concavified Lagrangians introduced in section C. Theorems 3 
and 4 then yield the following corresponding results. 

Theorem 6. If z v . ... z v is a local maximum of f(z v . . ., z„) 
subject to the constraints gf(z lt . . ., z„) ^ 0 and certain 

regularity conditions are satisfied, then the gradient method 
applied to the concavified Lagrangian with transformed 
constraints L*(z v . . ., z v ) p v . . ., pjrj) with rj / s sufficiently 
large, converges to a saddle-point (z v . . ., z p ; § lt . . ., p s ) 
if the initial approximation is sufficiently close. 

Theorem 7. If z v . . ., z v is a local maximum of f(z v . . ., z v ) 
subject to the constraints g } (z v . . z„) ^ 0 and certain 

regularity conditions are satisfied, then the gradient method 
applied to the concavified Lagrangian with quadratic modi- 
fication, U{z v . . ., z v , Wj , . . ., w s ; p lt . . ., pJX) with X suf- 
ficiently large, converges to a saddle-point (z v . . ., z v , 

w v ...,w s ; p lt . . ., p s ) if the initial approximation is suf- 
ficiently close. 

F. The Price-Adjustment Method. 

As in section A.4, we may consider a modification of the 
gradient method of the previous section in which some of the 
variables are assumed to adjust inifinitely rapidly, or, in other 
words, to be at all points in the adjustment process at an optimal 
position given the values of the other variables. For the gradient 
method described by (43 — 44), we consider the variant where the 
z { ‘ s are supposed to maximize the Lagrangian for any given set 
of p/s, while the p/s continue to be adjusted in accordance with 

(44) . In symbols, (43) is replaced by, 

S 

(45) z v ...,z p maximize f{z v . . ., z v ) + 2 Pjgj( z i> ■ • •> z v) 

i = 1 

for given p v . . ., p s . 

(44) and (45) together constitute a dynamic system. This 
system is closer to the usual supply-and-demand model than 
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the system of the preceding section. In the present system, for 
any given set of prices, the activity levels optimal for that set 
of prices are determined. These determine, in turn, the supply 
and demand of each commodity, and the prices then move in a 
direction determined by the difference of supply and demand. 
In the system of the last section, the activity levels were not at 
an optimal level but only being varied so as to increase the 
difference between utility and costs. 

As noted in section A. 4, any system which depends upon 
instantaneous optimization with respect to some variables 
runs the risk of being undefined. At any point in the adjustment 
process where the prices still differ from equilibrium, some of 
the optimal activity levels might be infinite. With this qualifi- 
cation, however, the price-adjustment process defined by (44) 
and (45) converges satisfactorily. 

Theorem 8. If f(z v . . z 9 ) is strictly concave and 

gj (z 1} . . ., z 9 ) is concave for each /, then the price-adjustment 
process defined by (44) and (45) converges in a region suf- 
ficiently close to equilibrium so that (45) is always well-defined. 

III. CONDITIONS FOR VALIDITY OF THE NATURAL 
MARKET MECHANISMS 

In this part, we will state the assumptions about the resource 
allocation model of part I, section C, for which the global static 
and dynamic characterizations of an optimum given in Theorem 
2, 5, and 8 of part II can be applied in a straightforward fashion. 
The resulting criteria have natural economic interpretations. 
For the static characterization, the assumptions needed are that 
all the functions involved are concave; for the determination 
of dynamic processes which converge to an optimum, somewhat 
stronger conditions are needed. 

A. Static Characterization of the Optimal Allocation. 

We will first make the following two assumptions about the utili- 
ty sector, respectively, in the resource allocation model of Part. I 
sectiomC. 

(U —C) The utility function JJ (y v . . ., y n ) is a concave function. 
(P — C) The functions gu{x s ) are concave functions. 
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Assumption (P — C) is a straightforward statement of non- 
increasing returns to scale. The status of assumption (17 — C) 
is slightly more complicated. The ordinal point of view would 
deny meaning to any statement about a utility function which 
is not invariant under monotonic transformation. Concavity is 
not a property with the desired invariance; indeed, it implies 
diminishing (more strictly, non-increasing) marginal utility for 
each commodity, a well-known shibboleth for distinguishing 
the cardinalists from the ordinalists. 

The usual ordinalist assumption is that the indifference surfaces 
are convex to the origin. An alternative formulation can be 
made in terms of the following definitions: 

Definition 3. A function f(z lt . . ., z 9 ) is said to be quasi- 
concave if for any two distinct points (z 1( . . ., z v ) and 
(z [, . . ., *') and any third point (z ”, . . ., z”) such that 
z'l == dz ( -f- (1 — 6)z' t (i = 1, where 0 < 0 < 1, 

f{z ", . . ., z”) is at least as great as the smaller of the two num- 
bers f(z v . . ., z p ) and f{z [, . . ., z' v ). The function is said to 
be strictly quasi-concave if the strict inequality holds in the 
last statement. 

For the moment, we will not use the last part of the definition. 
It is easy to see geometrically that an indifference map which 
possesses a quasi-concave utility indicator indeed statisfies the 
usual convexity assumptions. 46 ) This property further is in- 
variant under monotone transformations of /. 

Now it has been shown that under certain weak regularity 
conditions, there exists for any quasi-concave function a mono- 
tone transform which is concave. 46 ) Such a function can, of 

45 ) Despite the importance of this assumption in consumer’s demand theory, 
little attention has been given to its justification. For an argument due to 
T. C. Koopmans, see K. J. Arrow, “An Extension of the Basic Theorems of 
Classical Welfare Economics,” in Proceedings of the Second Berkeley Symposium , 
pp. 529 — 30; T. C. Koopmans, Three Essays on the State of Economic Science 
(New York, Toronto, and London: McGraw-Hill, 1957), pp. 26 — 28. See also 
W. M. Gorman, “Convex Indifference Curves and Diminishing Marginal 
Utility,” Journal of Political Economy , LXV (1957), 40 — 50. 

46 ) See W. Fenchel, Convex Cones , Sets, and Functions (Department of 
Mathematics, Princeton University, 1953) (mimeographed), pp. 115 — 37; 
B. de Finetti, “Sulle stratificazaioni convesse,” Annali di Matematica Pura 
e Applicata, vol. 30 (1949), 173—83. 
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course, equally well serve as a utility indicator. The significance 
of (U — C), then, is that among the infinitely many utility 
functions which represent the indifference map of the economic 
unit, we can choose (at least) one which is concave. 

The problem is to characterize the maximization of the utility 
function, subject, to the constraints (2), with respect to the 
variables y v . . ., y n , x v . . ., x m . It is easy to verify from Defini- 
tion 1 that a function which is concave with respect to some 
members of a set of variables and where the other variables do 
not enter at all is concave with respect to all of them. From 
( U — C), then, U(y v . . .,y n ) can be regarded as a concave 
function of all the variables y v . . ., y ni x v . . ., x m . Similarly, 
gijtyj) are concave functions of the same variables. The function 
—y it is a linear and therefore concave function of all these 
variables. If we let 

m 

( 46 ) gi(y v ■ ■ •> Vn> • • •. *») = (* = !,•••> »), 

3=1 

m 

( 4 ?) gi{yv ■ ■ ■> Vn, * 1 , . . •> x m ) = 2,gij( X i)+£i (*==«+!,..., s) 

3=1 

then each of the functions g* is a sum of concave functions and 
therefore itself concave. (Note that we write as a function 
of all the ^/-variables, though in fact depends only on 2 /* if 
I i ^ n and does not depend on any of the y - variables if 
i > n.) 

With the aid of (46) and (47), (2.1) and (2.2) can be rewritten 

( 4 ») g l {Vl, ■ ■ ; Vn. *1, • • •. *J ^ 0 (*' = 1, . . S). 

The function g { is the excess of supply (both produced and natural) 
over demand (both final and interindustry). It will be referred 
to as the excess suffily. 

The problem of maximizing U(y v . . ., y n ) subject to the 
constraints (48) now satisfies the conditions of Theorem 1; the 
variables y v . . ., y ni x v . . ., x m correspond to z v . . ., z Vi the 
function U(y v . . .,y n ) to f(z v . . z 9 ), and the functions & 

to th$ functions g jt It can therefore be asserted that: 

(49) a necessary and sufficient condition that y lf . . ., y n , 
a?i, . . ., x m maximize U(y v . . ., y n ) subject to the constraints 
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(48) is that there exist numbers p x , • . p 8 such that the La- 
grangian 

L(y v . . y n , x v . . x m ; p v . . p s ) 

8 

“ U (yi , . . y n ) -f- iPigdy* • • •> Vm Z'i* • • •> 

i=i 

has (&, . . y n> x lf . . x m \ p v . . p s ) as a saddle-point. (Here, 
p i is the Lagrange multiplier associated with the constraint 
gi ^ 0.) 

To simplify notation, we shall write y for (y v . . y n ) and 
similarly with the other sets of variables. 

From its definition, the saddle-point criterion contains two 
statements: 

(50.1) the function L(y , x; p) (i.e., the Lagrangian with the 
multipliers replaced by their saddle-point values) attains its 
maximum as a function of y and x when 47 ) y — y and x = x; 

(50.2) the function L(y, x; p) attains its minimum at p = p. 
We will now give some economic interpretation to the state- 
ments (50) by considering them in more detail. First, consider 

(50.2) ; the interpretation has already appeared in the discussion 
leading up to Theorem 2. The function L(y, x; p) is linear in p. 
A mechanical application of (18) in part II, section A.5, leads 
to the following results: 

(51) gAy, x) ^ o 

(52) if g t (y, x) > 0, then p t = 0. 

(51) is a repetition of the feasibility conditions (48) or (2). 

(52) adds the condition that for any commodity for which supply 
exceeds demand at the optimum the associated Lagrange multi- 
plier is zero. 

To draw the implications of (50.1), we will replace in the 
Lagrangian by its definition as given in (46) and (47). 


47 ) V — V is an abbreviation for the set of equalities — y v y 2 = y, . . 
y« = 2/«* Similar notation is used for other sets of variables. 
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n m 

(53) L(y, x\ p) = U{y v . . y n )+2p t [ 2 £«(*,) -&+£,] 

i = 1 

S 772. 

+ 2 Pi CZgijfo) + £*] 

<=n+l i=l 

= [U (yi> ■ ■ ■ ,y n ) - ipiVz] 

i= 1 

ms s 

+ 2 C 2PiSa( x i)1 + 2Pi£i- 

3=1 8=1 i=l 

The Lagranglan has now been expressed as the sum of (m+2) 
terms. The first depends on y v . . y n , but not on any of the 
x/s. Each of the next m is of the form, 

S 

2 Pi Safa) = ™A x i)> 

i=l 

say, and depends only on one variable, the scale x j of the j th 
process. Finally, the last term is a constant. To maximize a 
sum of functions each depending on a different set of variables 
involves only maximizing each of them separately. Condition 

(50.1) can then be written, 

n 

(54. 1) the function U (y v . . y n ) —^pi y { attains its maximum 

i—i 

with respect to y lf . . y n at y,. = y t (i = 1, . . n); 

(54.2) for each /, the function ^fo) attains its maximum at 

= *#• 

The economic and institutional interpretation of the above 
criteria, particularly (52) and (54), as is by now familiar, requires 
identifying the Lagrange multiplier associated with a commodity 
restraint as the price of that commodity. Thus maybe thought 
of as the price of commodity i. Condition (52) then states that 
the equilibrium price is zero for any commodity for which there 
is an excess of supply over demand at equilibrium. Condition 
(54.1) says that the final demands are to be chosen so as to 
maximize the difference between their utility and their cost (at 
equilibrium prices). 

In discussing (54.2), first note that, by definition, nfa) is 
the profit (calculated at equilibrium prices) obtained by running 
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the / th process at scale x r Then (54.2) states that each process 
should be run at a scale which will yield maximum profit. 

Conditions (54), particularly, bring out the possibility of 
decentralization through the price system. Given the set of 
equilibrium prices, the choice of the final demands can be made 
separately from the choices of the process scales, and, further, 
the latter can be made separately from each other. Let us 
imagine that a manager is appointed for each process and that a 
helmsman is charged with choosing final demands. 48 ) For any 
given set of prices, each manager is instructed to choose that 
scale which will lead to a maximum of the profits, 

(55) ZfiigM = M x > P)- 

t=l 

At the same time, the helmsman is instructed to determine the 
level of the final demands so as to maximize the difference 
between utility and costs, U(y v . . ., y n ) — F or each 

desired commodity i, the choice of x s by each manager determines 
an output (possibly negative) g^(x^) by the j th process, while 
the helmsman's decision includes one for y i% Thus net demand 
for the desired commodity i for final and intermediate use is, 

m 

-£&#(*#); 

for the equilibrium prices, conditions (51) and (52) require that 
this net demand does not exceed the initial supply and that, if 
the two are unequal, the price p i must be zero (if the optimum 
can be reached without using the full initial supply, then the 
commodity is a free good). Similarly, for a primary commodity i , 
the net demand, 

m 

-1SiA x i). 

i = i 

must be compared with the initial supply at equilibrium, 
the net demand must not exceed the initial supply, and the price 
pi must be zero if there is an excess supply. 

« 

48 ) We follow the terminology of Koopmans, "Production as an Efficient 
Combination . . pp. 93 — 95, as developed for the linear case; however, the 
function of the helmsman is somewhat different here. 
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The elements of decentralization here are clear. For a given 
set of prices, a process manager need know only the prices 
and the technology of his own process in order to arrive at the 
optimal level for his process. The helmsman need only know the 
prices of the desired commodities and the utility function. Finally, 
the equilibrium on each market may be checked separately; 
for any given market, the test requires knowing only the net 
demand, which is an aggregate of many individual decisions, the 
initial supply, and the price. 

Theorem 9. If (U — C) and (P — C) hold, then y v . . ., y nt 
and x v . . .,x m are final demands y v . . .,y n and process levels 
x v . . ., x n which maximize the utility function U(y lr . . ., y n ) 
subject to the feasibility constraints if there exist prices 
p v . . .,p s such that the following conditions are satisfied: 

n 

(a) y v . . y n maximize the difference, U(y v . . y n )— 2,$^, 

between utility and costs; i=1 

8 

(b) for each process /, x s maximizes the profit, 

(c) for each desired commodity i, the aggregate excess of 

m 

supply over demand, 2 Sufis) + £ i — Vi> is non-negative; 
j- i 

if positive, then the price is zero. 

(d) similarly, for each primary commodity i, the aggregate 

m 

excess of supply over demand, 2 £«(**) + is non- 

3=1 

negative; if positive, then the price p 4 is zero. 

It may be noted that there may be more than one maximum 
in (a) or (b). This will usually be the case in (b) if the functions 
are linear, as in the linear programming case. In general, 
in this case, a process manager will find a range of scales, each 
of which achieves the possible profit, but only some scales will 
in fact be optimal for the economy. The choice by the manager 
of a sd&le which is not optimal for the economy as a whole will 
be revealed by a violation of one of the conditions (c) or (d). 
The validity of Theorem 9 is not affected by this remark, but 
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the force of the decentralization argument is somewhat weakened 
under these conditions. 


B. An Alternative Static Characterization of the Optimal Allocation . 

Theorem 9 as a characterization of an optimal allocation is, of 
course, closely related to the criteria of welfare economics, as 
given by Hotelling, Lange, Koopmans, Allais, Lerner, and 
Bergson. 49 ) However, condition (a) is somewhat different from 
the analogous characterization of Pareto optima when there are 
many consumers. In the latter case, each consumer is faced with 
a constrained maximization problem, that of maximizing the 
utility function subject to a budget restraint. Since we are here 
treating the special case where there is only one consumer, there 
should be and is an analogous theorem, with (a) replaced by a 
constrained maximization. 

Let M = ^l^iPiVty the total expenditure of the helmsman at 
equilibrium. Suppose he is now told to maximize utility subject 
to the constraint that his expenditures be i£f, i.e., 

(56) 2Piyi = M. 

1=1 

Since the result of a maximization is not affected by changing 
the maximand by a constant, the problem is equivalent to 
maximizing U(y v . . y n )—M = U(y v . . «/„)— sub- 
ject to the constraint (56). But we already know that the un- 
conditional maximum for the last maximization is 
( i = 1, . . ., n). Since this solution satisfies the constraint (56), 
it must also be the constrained maximum. Hence (a) is equivalent 
to, 

(57) y v . . .,y n maximize U{y 1> . . ., y n ) subject to the con- 
straint (56). 


49 ) See H. Hotelling, “The General Welfare in Relation to Problems of 
Taxation and of Railway and Utility Rates,” Econometrica, Vol. 6 (1938), 
242 — 67; Lange, “Foundations,...” op. cit.; Koopmans, ch. 1 in Three Essays, 
op. cit .; M. Allais, Traite &' Economic Pure (Paris: Imprimerie National^, 1943) 
III, 604 — 82; Lerner, op. cit.\ A. Bergson (Burk), “A Reformulation of Certain 
Aspects of Welfare Economics,” Quarterly Journal of Economics, VIII (1938), 
310 — 334. 
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So far, this is not very interesting since the condition (57) 
requires a knowledge of M. Now let us recall the homogeneity 
properties of demand and supply functions derived from utility 
— and profit — maximization. The behavior described in (57) 
is that of the consumer in the usual theory of demand. If M 
is changed to any value M and the prices changed in the same 
proportion, the choice of the y/s is unaffected. At the same time, 
the choice of x 3 is unaffected by a change of all prices in the same 
proportion. Hence, if M is replaced by any number M, we can 
choose the price so that (57) and (b) of Theorem 9 are satisfied. 
In (c) and (d), the prices enter directly only in the form of 
conditions for zero prices; but a change of all prices in the same 
proportion leaves the zero prices unaltered. We can state, 
Theorem 9'. Theorem 9 remains valid if (a) is replaced by, 
(a ')y v . . ., y n maximize U (y lf . . y n ) subjects to the budget 
restraint, Pi Vi = M, where M can be chosen arbitrarily. 

The constrained maximum (a') is, as usual in consumers' 
demand theory, invariant under monotonic transformations of 
the utility function. It can thus be shown that Theorem 9', 
unlike Theorem 9, remains valid if the utility function is to be 
merely quasi-concave, rather than concave. 


C. The Economic Meaning of the Gradient Method. 

We will now apply the gradient methods of section E, part II 
to the resource allocation model of part I, section C. The for- 
mulation is parallel to the application of Theorem 2 as given 
in section A of this part. The differential equations (43) and (44) 
then become. 


(58) 

(59) 

(60) 


dv.ldt = 1° if Vi “ ° and ( dU / dy ^ ~Pi< 0 ’ 

* 1( dU/dyi ) — pi otherwise; 


dxjjdt 

(0 

dpjdt = 


|0 if Xj ■■ 
\d7Zjl dXj 


= 0 and djtj/dxj < 0, 
otherwise; 

0 if pi = 0 and g { > 0, 
g t otherwise. 


In (59), n j is the profit evaluated at current prices, as defined 
in (55). 
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These equations can be obtained by directly substituting the 
particular maximand and constraints of the resource allocation 
problem into the general gradient method. It is perhaps more 
illuminating to notice that (58) and (59) are direct dynamic 
counterparts of the conditions (a) and (b) of Theorem 9. Once 
the problem of the helmsman has been stated as an unconstrained 
maximization, it is natural to transform the static condition of 
finding the maximum into the gradient method for approaching 
it; this is precisely (58) (compare section A.3, part II). Equation 
(59) has precisely the same relation to condition (b) as (58) has 
to (a). 

But of course we now assume that the maximization processes 
of (58) and (59) are being applied at any set of prices taken as 
given to the helmsman and the managers, not merely at the 
equihbrium set of prices. As the final demands y t and process 
scales Xj are varied in accordance with (58) and (59) respectively, 
the prices which the helmsman and the process managers take 
as given are themselves being varied in accordance with (60). 
The meaning of (60) has already been explored in part II, 
section E. 

Thus we see that the gradient method may be given the 
following institutional interpretation. The helmsman, taking the 
prices of desired commodities as given, changes each final 
demand at a rate equal to the difference between marginal 
utility and price, except that if the final demand for any com- 
modity is zero and the marginal utility is less than the price, 
the final demand remains at zero (since negative final demands 
have no meaning). Hence the final demand for a commodity is 
increased if marginal utility exceeds price and, if not already 
zero, decreased if marginal utility is less than price. For each 
process, the manager, taking all prices as given, changes the scale 
of his process at a rate proportional to its marginal profitability, 
except that if the scale is zero and the marginal profitability 
negative, the scale remains at zero. Thus a process increases 
in scale if the marginal profitability of expansion is positive 
and decreases in the opposite case if not already zerof 

The choice by a manager of his process scale determines the 
output or input of each commodity in the process. For any one 
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commodity, total the outputs (taking inputs as negative out- 
puts) for all processes, add the initial supply and, in the case 
of a desired commodity, subtract the final demand determined 
by the helmsman. The result is the excess suffily, denoted by 
g if of the desired and primary commodities respectively. We may 
imagine for each commodity a custodian (again in Koopmans's 
terminology) who varies the price of the commodity at a rate 
proportional but opposite in sign to the excess supply, with the 
qualification that if the price is zero and the excess supply 
positive the price remains at zero. This instruction is the well- 
known “law of supply and demand”: the price of a commodity 
rises if demand exceeds supply, falls in the opposite case. 

The instructions to the helmsman, the managers, and the 
custodians represent a high degree of decentralization of the 
information-gathering and decision-making functions. The helms- 
man need only know the utility function and the prices of desired 
commodities to carry out his rule (58). The manager of a process 
need know only its technology, as represented by the functions 
giA x j)> an( i prices of the commodities entering it. A custodian 
need only know the aggregate difference between demand and 
supply on his market (not the offers and demands of individual 
processes or of the helmsman) and the price of his commodity. 


D. The Conditions for Validity of the Gradient Method . 

The solution of the system of differential equations (58 — 60) 
will not converge for any arbitrary set of functions U(y v . . ., y n ) 
and Not even the concavity of all these functions suffices. 

As was seen in part II, section D, if all the functions involved 
are linear, the gradient method will lead to indefinite oscillations 
in some or all of the variables with no convergence to the op- 
timum solution. 

Theorem 5 supplies a sufficient condition for convergence in 
the general constrained maximization problem but one that, 
unfortunately, is not applicable to the resource allocation 
problem. The reason for inapplicability is that the maximand 
U(y v . . ., y n ) does not contain all the variables with respect to 
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which it is maximized and hence cannot be strictly concave in 
all the variables. 

To obtain the desired results we will now strengthen assump- 
tion ( U — C) to, 

{U — SC) U(y 1 , . . ., y n ) is strictly concave in y x , . . y n . 

The justification for ( U — SC) is similar to that for ( U — C). 
In consumers’ demand theory it is usual to assume not merely 
that the indifference surfaces are convex to the axes but also 
that they are not linear or planar and have no linear segments 
in them. This requirement insures that the demand functions 
are well-defined (i.e., single-valued) since, for any budget plane, 
the indifference surface tangent to it has only one point in 
common with it. We may state this as a requirement on the 
utility indicator in the terminology of Definition 3. 

(U — SQC) U(y lt . . ., y n ) is a strictly quasi-concave function. 

Under suitable regularity conditions 50 ), if ( U — SQC) holds, 
there is a monotonic transform which is strictly concave. There- 
fore assumption ( U — SC) means essentially that a suitable one 
of the utility indicators has been chosen in the definition of 
condition (58) of the gradient method. 

While assumption (U — SC) is not sufficiently strong to insure 
that all the variables will converge to their optimal values when 
the gradient method is applied, it is sufficient to insure that the 
most important ones do. 

Theorem 10. If (U — SC) and (P — C) hold, and the 
variables y t , x jt p { are varied in accordance with the gradient 
method defined by equations (58 — 60), then, for each desired 
commodity i, the final demand y t and the price ■p i converge 
to their optimal values and § t ( i — 1 , . . ., n). The process 
scales Xj and the prices of the primary commodities 
(i = n + 1, . . ., s) may oscillate indefinitely but cannot 
diverge. 

Further, for any desired commodity for which the optimal 
final demand is positive, the difference between supply and 
demand approaches zero. In symbols, for any i for which y { > 0, 

m r 

lim lVi -lSiA x i) ~ £i] = 0. 

t — s^oo i=l 

60 ) See footnote 46. 
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Although Theorem 10 assures convergence in the most im- 
portant variables, it is not completely satisfactory. The scales 
of the processes must, in the limit, supply the equilibrium final 
demands, but it is not guaranteed that they will satisfy the 
feasibility constraints with respect to primary commodities. 
The process scales may instead oscillate indefinitely in such a 
way that the demand by processes for a primary commodity 
oscillates indefinitely about the initial supply. Examples can 
be given to show that this possibility is not ruled out by any 
assumption thus far made. 

As might be expected, the possibility of oscillations is con- 
nected with linearity of the production processes. We can make 
the following 

Remark. Even if some of the processes are linear, oscillations in 
the process scales can occur only “by accident”, in the sense 
that special relations must hold among the input-output coef- 
ficients which define the different linear processes. 

It follows that the gradient method in general provides a 
satisfactory solution of the resource allocation problem under the 
assumptions of Theorem 10. 

In any case, the absence of linearity is a sufficient condition 
for convergence in all the variables. We need to assume a 
sharpened version of (P — - C); instead of merely postulating 
non-increasing returns, we will wish to assume strictly diminishing 
returns. In mathematical terms, this means that the functions 
gijiXj) defining a process j are all strictly concave. 51 ) However, 
we do not want to assume that the functions g iS (x s ) are strictly 
concave for all i and j because that would imply that every 

61 ) The functions g i7 (x f ) are here considered to be strictly concave as functions 
of the process scale x f . In nonlinear processes, there may be more than one natural 
definition of a scale. From one point of view, any strictly monotonic transfor- 
mation of a scale is itself a scale, since it serves equally well as a parameter 
in defining the different levels of the process. However, the previous theorems 
have shown that there is an advantage in choosing the scale so that the functions 
giffci) are concave. If this condition does not uniquely specify the choice of 
a scale, then the scale can be so chosen that all the functions g if which enter 
the process can be chosen strictly concave. 

It may T>e worth remarking that the condition that it be possible to choose 
strictly concave input and output functions is stronger than the condition 
that the process not be linear. The first condition requires that the process 
have no linear relations between any input and any output. 
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commodity enters into every process, either as input or as 
output. We therefore assume the following less stringent sharpen- 
ing of (P — C): (P — SC). For each commodity i and process j, 
the function g i} {x } ) is either strictly concave or identically zero. 
We can then state, 

Theorem 11. If ( U — SC) and (P — SC) hold, and the 
variables y it x s , and f>, are varied in accordance with the 
gradient method defined by equations (58 — 60), all variables 
converge to their equilibrium values. 52 ) 

E. The Price-Adjustment Method. 

We will now apply the price-adjustment method presented for 
constrained maximum problems in part II, section F, to the 
resource allocation problem. The formulation is obvious; the 
price-adjustment is the limiting form of the gradient method 
of (58 — 60), where the differential equations (58) and (59) are 
replaced by the conditions that for any given set of prices the 
helmsman chooses that set of final demands which maximizes 
the difference between utility and costs and the manager of 
each process chooses that scale which maximizes profits (55). 
The totality of these decisions determines supply and demand 
on each market; the custodian varies price in accordance 
with (60). 

It must be recalled, as already seen in part II, section F, 
that the instructions to the managers to choose maximizing 
values for their scales implies that there exists a maximum and 
that it is unique. The first condition is satisfied when prices are 
sufficiently close to their equilibrium values but not necessarily 
•everywhere; the second condition requires that the profit 
function is strictly concave, which is guaranteed if (P — SC) 
holds. Similarly, the helmsman has a unique maximum in the final 
■demands when prices are close to equilibrium and the utility 
function is strictly concave. We shall therefore assume that 
(P — SC) and ( U — SC) are both valid. 

For any given set of prices, p lt . . p s , let the helmsman choose 
y v ...,y n so as to maximize U {y lt . . ., y n ) — 22* p { y the 

52 ) Mathematical proofs of the results in this section will be found in K. 
.Arrow and L. Hurwicz, “Gradient Method for Resource Allocation/’ op . tit. 
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manager of the j th process choose x } so as to maximize n 3 (x } ). 
Denote the choice of helmsman as a function of the p/s by, 

Vl(Pl> • • ->Pn)> • • ;Vn(Pl> ■ • ;Pn)- 

ViiPi: ■ ■ •> Pn) is the final demand function for the i tn desired 
commodity. The value of x } chosen by a manager determines 
the net output, of the i th commodity by the j th process; 

as a function of the prices, it may be denoted by y tj (p lt p s ), 
the supply (demand if negative) of the i th commodity by the 
7 th process. The aggregate excess supply for the i th commodity, 
as a function of prices, is then, 

m 

(61.1) YiipL . . ., Pa)='2,Yij{Pl< • • ■> Ps)-\-£i— Vi(Pl> • • •. Pn) 

3=1 (i = 1, • • n ); 

m 

(61.2) y t {fa . . ., p a )=J,y ij {p v . . ., p s )+£ { [i = n+ 1, . . ., s). 

i=i 

In this notation, the equilibrium conditions for an optimal 
allocation, as given in Theorem, 9 are, 

(62) ytipi, . . ., p s ) ^0 (i = 1, . . ., s); for any i such that 

Yi(Pl> ■ ■ ■> Pa) > 0 , pi = 0 , 

that is, the equalization of supply and demand with the necessary 
qualification for free commodities. The equilibrium values of 
final demands and process scales are determined from the equi- 
librium prices by the maximization that determined the supply 
and demand functions. y { = y { (p v . . ., p n ), and is the profit- 
maximizing choice which determined the functions y i} , evaluated 
for equilibrium prices. 

The dynamics of the price-adjustment method are then simply 
stated: 


(63) 


ifjdt - (° “ f< = 0 “? ^ \ ■ - « > o. 
\— Yi(Pi> • • •. Pa) otherwise. 


Since the price-adjustment method is simply a limiting form 
of the gradient method, the conditions for its validity are the 
same. • 

Theorem 12. The price-adjustment method defined by (63) 
converges under assumptions (U — SC) and ( P — SC). 
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F. Remarks on an Alternative Form of the Price-Adjustment 

Method. 

The static characterization of an optimum given in section B 
suggests the following alternative form of the price-adjustment 
method. The supply and demand functions of the processes, 
Ytj{pv . . p a ), are generated in the same way as in part E, 
but the final demand function, rj { {p lt . . ., p n ), are defined by 
the maximization of U (y v . . ., y„) subject to the budget restraint, 
PiVi — M. (The dependence of the final demand rj { on M 
is not indicated in the notation, since M is taken as a parameter 
and remains fixed throughout the adjustment process.) Then the 
definition (61) of excess supply is altered only by replacing 
ViiP v ■ • •> Pn) in (61.1) by . . ., j > „); let ftfo, . . ., p a ) be 

the excess supply in the new system. Finally the dynamics are 
supplied by (62), with ...,p 8 ) replaced by ?<(&, ...,&). 

In this form, the system is a special case of a competitive 
economy, in which the productive units maximize profits at 
any given level of prices, the consumers (in this case there is 
only one) maximize utility at any given level of prices, and 
prices vary according to the law of supply and demand. This 
is the model sketched by Walras and formulated more precisely 
by Samuelson. The conditions under which the equilibrium of 
such an economy is stable are by no means well known, but a 
number of sufficient conditions have been established. 53 ) 

IV. MODIFIED MARKET MECHANISMS IN ABSENCE OF 
DECREASING RETURNS 

In this part, we will treat the modifications of the market 
mechanism which might cope with the problems of resource 
allocation arising under increasing or constant returns. The 
treatment must, of course, be much more tentative. The pos- 
sibility of allocative mechanisms which achieve optimal allocation 
under increasing returns and still have some measure of decen- 
tralization is only beginning to be explored. 

58 ) See K. Arrow and L. Hurwicz, ‘ 1 On the Stability of Competitive Equilibrium 
I,” Econometrica, Vol. 26 (1958), 522-552; K. Arrow, H. D. Block, and L. 
Hurwicz, “On the Stability of Competitive Equilibrium II/* ibid , Vol. 27 
(1959), 82-109. 
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A. General Remarks. 

As we have seen in several places in part II, the absence of 
concavity conditions on the functions involved has two con- 
sequences for the characterization of maxima (constrained or 
unconstrained): the first-order conditions do not completely 
distinguish maxima from other stationary points, and in any 
case do not in any way distinguish global from merely local 
maxima. 84 ) The latter problem cannot be dealt with by any 
static characterization which is based on derivatives. Correspond- 
ingly, no variation of the gradient method, which is based on 
moving uphill as measured solely by local variations, can be 
expected to insure arrival at the highest of several peaks; at 
best, only convergence to a local maximum can be expected. 

A simple illustration is provided by an economy with one 
output, one input, and two processes both operating under 
increasing returns. Let x t be the input into the i Xb process, 
/i(a; £ ) the output of the i tb process, and g the total am ount of 
input available, so that x t + x 2 — g. We assume /(.(»<) > 0 
ft (%i) > 0. We wish to maximize y(a; x ) — / x (a; x ) + f 2 (g — « x ). 
We then have 

V( x i) = /£(* 1) - /»(f - %)» 
v"i x i) = fi( x i) + /* (? ~ «i)- 

We locally maximize y(a; x ) subject to 0 ^ x y ^ g. If the maxi- 
mizer satisfies 0 < £ x < 1, then we must have y'(a! x ) = 0, 
y"{x\) ^ 0. But under the assumptions made y"(a: x ) > 0 for 
all x v Hence either x 1 = 0 (and x 2 = g) or — g (and x 2 = 0); 
in any case one of the two processes is at zero level. 

Now suppose that /((£) > f' 2 (0) and also f' 2 (g) > /((0). Then 
y>' (g) > 0 and ip' (0) < 0, so that x 1 = g and x 1 = 0 are both 
local maxima. (That this can happen is seen from the example 
fii x i) — x i x v a i > 0.) If / x (f) ^ / 2 (l), only one of the two local 
maxima is a global maximum. (Take a x ^ a 2 in the preceding 
example.) Hence we may expect to encounter cases of local 
maxima which are not global. 

We will suppose from now on that we know in a general way 

64 ) See part II, sections A. I and A. 5, and the Remark to Theorem 1 in 
section B 
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where the global maximum is; the dynamic problem will be that 
of locating it precisely. Within the framework of gradient 
methods, nothing better can be expected. 

In any case, we will want to seek processes that do not converge 
to stationary points that are not even local maxima, while if we 
do start in the neighborhood of a local maximum we wish our 
approximation methods to converge to it. These, then, are the 
criteria by which we will judge modifications of the market 
mechanisms which seek to yield optimal resource allocations under 
increasing returns. Our mathematical tools are the static 
characterizations of part II, section C, and their dynamic 
counterparts, Theorems 6 and 7 of part II, section E. 

B. The Unmodified Lagrangian Conditions. 

For static conditions for an optimal allocation, we start with 
the Kuhn-Tucker Theorem 1, which has in fact been the basis 
of discussions of optimal allocation to date. The application of 
Theorem 1 to the resource allocation problem is entirely parallel 
to that of Theorem 2 in the case where the concavity conditions 
necessary for the latter, as given in Theorem 9. Each of the four 
conditions there has a parallel in the more general case. 

The analogue of condition (a) is that, 

(64) U V( <; pi ( i = 1 ,...,«); if U Vt < pi, then y { = 0. 

If we retain the concavity conditions (U — C) for the utility 
function, then the relations (64) are necessary and sufficient 
for a maximum of the difference, U(y v . . y n ) - 2 > so 
that condition (a) of Theorem 9 remains valid. 

The analogue of condition (b) is that, 

(65) dnJdXj iS 0 for x } = xp, if djiJdXj < 0, then x } = 0. 

In (65), the function n^x,) is the profit, calculated at equi- 
librium prices, attached to the j th process, as defined just 
preceding equations (54). 

Finally, conditions (c) and (d) remain valid. 

Theorem 13. If (U — C) holds, then the following con- 
ditions are necessary for an optimal resource allocation: 

(a) the helmsman chooses the final demands y lt . . .,y n so 



DECENTRALIZATION AND COMPUTATION IN RESOURCE ALLOCATION 89 


as to maximize the difference, U(y v . . ., y n ) — 2JLi fiiVi, 
between utility and costs calculated at equilibrium prices; 

(b) each process manager chooses a process scale x 0 such that 
dnjdxj fg 0, but the strict inequality can only hold if 

Xj = 0; 

(c) for each commodity i (i = 1, . . ., $), the excess supply 
g { evaluated at equilibrium final demands and process 
scales must be non-negative; if positive, the price p~0. 55 ) 

The difference in conditions for optimal resource allocation 
between the general case discussed here and that in which 
concavity assumptions are made about production as well as 
consumption is solely that between condition (b) of Theorem 13 
and condition (b) of Theorem 9. The former requires that at 
equilibrium prices the marginal profitability of each process be 
non-positive; it can be negative only if the process is operated 
at zero scale. The last clause is perhaps more intuitively meaning- 
ful in its contrapositive form: 

(66) if the marginal profitability is positive at zero output, 
then the optimal value of the process scale must be 
positive. 

In a general way, the non-negativity clause has been recognized 
in the literature; it is well known that one of the chief problems 
in optimal resource allocation under increasing returns is determin- 
ing which processes are operated at some positive level. Statement 
(66) supplies a sufficient, though not necessary, condition. 

The parallelism between Theorems 9 and 13 suggests a supply- 
and-demand interpretation. The demand functions of the 
helmsman are defined precisely as before. For a process manager, 
however, the supply or demand for each product is defined as a 
function of all prices by first choosing the process scale x i so 
that marginal profitability is zero at the given set of prices 
(or negative at zero scale) and then finding the corresponding 
values of g {j (a^). The equilibrium prices are then defined in the 

fi5 ) It may be remarked that, analogously to Theorem 9, Theorem 13 remains 
valid v$ien condition (a) is replaced by condition (a') of Theorem 9'; see part 
III, section B. Theorem 13 is identical with the propositions of Hotelling, Lange, 
and Allais, op. cit. footnote 43, except for the emphasis on the non-negativity 
conditions. 
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usual way as those which equate supply and demand (or lead 
to an excess supply with zero price). 56 ) This interpretation is 
not as useful as the corresponding one in the case where the 
concavity conditions on production hold. First, for any given 
set of prices there is generally more than one value of the process 
scale which satisfies the stated conditions. Under increasing 
returns, the marginal profitability can easily increase from a 
negative value at zero output through zero to positive values. 
Then both zero and the value of the process scale which makes 
the marginal profitability zero satisfy the condition. If there is 
ultimately a phase of diminishing returns, the marginal profi- 
tability may again become zero, so there can be three possible 
values of the process scale. Even more complicated possibilities 
cannot be excluded. The corresponding supply and demand 
functions are thus multi-valued and not well defined. 

A second difficulty closely related to the first is that the 
system as a whole will in general have multiple equilibria. 57 ) 
We have already seen that this phenomenon is usual in applica- 
tions of Theorem 1; see the Remark following. Some of these 
equilibria will not even be local optima. 58 ) 

If we specify in advance a neighborhood of the optimal 
allocation and consider only alternatives within it, these two 
difficulties will in general disappear, and Theorem 13 provides 
a satisfactory static characterization. The next problem is the 
determination of a dynamic approximation method which 
converges to the optimum and is consistent with decentralization. 
Since the proper aim of a process manager may be to minimize 
rather than maximize profits, the dynamic system of Theorem 10 
cannot apply. The above supply-and-demand interpretation 
suggests a model in which the dynamic element is simply the 

66 ) This formulation is employed by Lange, Socialism, pp. 81 — 82. 

B7 ) This possibility is admitted by Lange but regarded as exceptional; see 
Socialism, pp. 82, 69 — 70. 

B8 ) There is a third problem, of a type which we have not stressed in this 
paper. If the optimal process scale is such that the marginal profitability is 
changing from negative values to positive ones, then the position is one of 
minimum, not maximum profits. While this is not a difficulty from the point 
of view of formal rules of operation, the incentives to the process manager are 
clearly a good deal less satisfactory than in the concave case where social optima 
correspond to profit maxima. 
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adjustment of prices according to supply and demand. The 
simplest form is the system studied in Theorem 12, with the new 
interpretation given to the supply and demand functions of 
processes; in this system price changes at a rate proportional 
to the negative of excess supply. We will show by example that 
such a dynamic system can be unstable. However, in section E 
below we will show that the price-adjustment model will be 
stable with more complicated rules for price changes. 

Let there be two commodities and one process. Co mm odity 1 
is a desired commodity with no initial supplies. Co mm odity 2 
is a primary commodity with an initial supply of 1. The one 
process has commodity 2 as input and commodity 1 as output. 
If we use the input as the process scale, x v the output is assumed 
to be x\/2, so that the process displays increasing returns. We 
assume that the single desired commodity 1 is desired at all 
levels. Then from an ordinalist point of view the utility function 
can be any strictly increasing function of the final demand y v 
We choose in particular a strictly concave utility function, 

(67) U(y 1 ) = (logy 1 )l2. 

In our notation, the above model can be written 

(66) gii( x i) — x il^> §21 (*i) = x v 

(69) = 0, | 2 = 1 . 

Let us derive the supply and demand functions in the notation 
of part III, section E. First, the demand function of the helms- 
man is obtained by maximizing the difference, U(y 1 )—j)- [ y 1 , 
so that 

(70) rjM = 1/2jV 

The process manager chooses x 1 to make the marginal profitabili- 
ty zero. Since a^fe) = p 1 g 11 ( x i) + p 2 Szii x i) = (M?/ 2 ) - p 2 x v 
dn 1 /dx 1 = p 1 x 1 — p 2 — 0, so that x 1 = pjp x . This choice of 
process scale generates the following excess supply functions for 
the two commodities by substitution into (68): 

(71) . YntPv P») = mi Vn(P» P 2 ) = ~<PJP 1 ). 

The optimal allocation is obvious by inspection; since the 
output is always desired and is an increasing function of the 
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input, we simply set x x = 1 and therefore y 1 = 1/2. If we start 
in a neighborhood of the optimum, we can disregard non- 
negativity considerations. The price-adjustment model analogous 
to that of part III, section E, becomes, 

(72) dpjdt = p 2 ) = *7i(Al) — yn(^i> ^ 2 ) — 

(73) = —YziPv Pz) = —YziiPv Pz) ~ £2 
= (M>i) - I- 

The equilibrium point of this system can be found by setting 
the right-hand sides equal to zero, that is, yi(p v p 2 ) = 0, 
yz(§i>Pz) = 0. From (73 ),p 1 = ^ 2 at equilibrium; by substitution 
into (72), we see that p x = p 2 = 1. To investigate the local 
stability of (72 — 73). we find the matrix of partial derivatives 
of the right-hand sides of (72 — 73) with respect to p± and p 2> 
evaluated at the equilibrium point; the condition for stability 
is that the real parts of the characteristic roots of this matrix 
be negative. 59 ) Straightforward calculation shows that the 
matrix is, 



The characteristic equation is a quadratic with roots [3 ± Vl7]/4, 
so that one characteristic root is positive, and the system 
(72 — 73) is unstable. 60 ) 

C. A Digression on Marginal-Cost Pricing. 

Suppose that the j th process has a single output, say com- 

59 ) See Samuelson, op. at., pp. 270 — 74. 

60 ) There is another gradient method which corresponds to Theorem 13 
(or, more generally, to Theorem 1) and which does converge in general to a 
local maximum. It is defined by moving in a direction which is as close to the 
gradient of the unconstrained maximization as possible compatible with 
satisfying the constraints at all times. In the resource allocation model, this 
would require that the excess supply for each commodity would have to be 
zero (apart from free goods) at all points in the approximation process, not 
merely at equilibrium. There seems no way of satisfying this requirement in a 
decentralized fashion. For this gradient process, see G. E. Forsythe, “ Computing 
Constrained Minima with Lagrange Multipliers,’ ' Journal of the Society for 
Industrial and Applied Mathematics, Vol. 3 (1955), 173 — 78; Arrow and Solow, 
op. cit., sec. 3. 
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modity 1, and that the scale of the process is taken to be that 
output. Then g l3 [x 3 ) = x jf and — g i3 (x 3 -) is the input of commodity 
i needed to produce one unit of commodity 1 (i = 2, . . s). 

Clearly, 

(74) dSiJdx, =h~i hl—g'u (*,)]. 

Since —g'^ix^) is the increase in input i per unit increase in 
output, (74) may be interpreted in familiar fashion; the marginal 
profitability equals price less marginal cost. Condition (b) then 
says that at the optimum price equals marginal cost, except 
that for a process not operated at all price may be less than 
marginal cost. 61 ) 

We have thus the familiar rule of marginal cost pricing as a 
condition for optimal allocation under increasing as well as 
diminishing returns, that is, that for a process operated at some 
positive level the output should be chosen so that the marginal 
cost equals the price. Under diminishing returns, the marginal 
cost curve will be rising, and the rule then chooses the point of 
maximum profit. Under increasing returns, however, the marginal 
cost may be falling, and the (socially) optimal output will yield 
minimum profit. 

Apart from the non-negativity condition, then, the usual 
marginal-cost-pricing condition is correct when applied, as in 
our model, to individual processes. But the condition is usually 
applied to firms, where a firm, in our formulation, is a single 
decision-making unit having control over several processes. In 
this form, it is not correct, contrary to prevailing opinion, that 
the optimal allocation corresponds in general to an equality of 
price and marginal cost, as the latter term is usually defined. 

To see this, recall that the cost function is defined in economic 

61 ) Actually, this interpretation is not dependent upon identifying the scale 
of process with the output. We can always write, 

s 

dftfldx, = g'^x,) {pi - 2 [& - gui^lg'u^i)]}- 
i=2 

The rateo, — gufojlgvfa), is clearly the marginal increase in input i for a 
unit increase in output; hence the term in braces is the difference between 
price and marginal cost. Since the factor, is necessarily positive, the 

statement in the text is always equivalent to condition (b). 
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literature as the minimum cost of producing any given output, 
prices being taken as given. 62 ) Marginal cost is then simply the 
derivative of the cost function with respect to output. 

The usual rule of equaling marginal cost to price, then, implies 
that the output of the firm is produced at minimum cost. Despite 
the reasonable sound of this statement, it is not, in general, a 
correct rule for optimal resource allocation. Consider for example 
a firm which has two processes with the same output but dif- 
ferent inputs, each process showing increasing returns. Let 
fiiVi) be the amount of input 1 required to produce amount 
y x of the output by process 1, / 2 (y 2 ) the amount of input 2 required 
to produce amount y 2 of the output by process 2. Let and | 2 
be the amounts of inputs 1 and 2 initially available. We assume 
increasing returns, so that f"(y x ) < 0, f 2 {y 2 ) < 0. The rule of 
m ini miz in g cost at any given output requires the form to choose 
y x and y 2 so as to minimize p x f x {y x ) + p 2 f 2 {y 2 ) subject to the 
constraint, y x + y 2 = y, where p x and p 2 are the prices of inputs 
1 and 2 respectively. This is equivalent to minimizing, 

wiVi) = PifiiVi) + PMv ~ Vi). 
with respect to y x subject to the inequality constraints O^y^y. 
By differentiation, y>"(y x ) < 0 throughout the interval, so that 
the minimum must occur at either y x = 0 or y x = y } i.e., y 2 = 0. 
Thus, no matter what values are assigned for p X) p 2 > and y, one 
of the two inequalities, 

fi(Vi) < fu / 2(2/2) < 

must hold. 

On the other hand socially optimal allocation requires maxi- 
mizing total output y x + y 2 subject to the constraints. 


62 ) For representative texts, see G. J. Stigler, The Theory of Price, (rev. 
ed.; New York: Macmillan, 1952) pp. 127 — 29; K. E. Boulding, Economic 
Analysis, (3rd ed.; New York: Harpers, 1955) ch. 34. This definition of cost 
is clearly the correct one in the theory of the firm, whether under competition 
or under monopoly; maximization of profit requires maximizing the difference 
between revenue and cost as defined in the text, or, equivalently, the equating 
of marginal revenue and marginal cost. * 

Writers who have dealt with optimal allocation under increasing returns 
have usually not been careful to define marginal costs; however, Lange ex- 
plicitly presents the definition given in the text (pp. 116 — 17). 
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fiit/i) ^ fi, / 2 (y 2 ) ^ It is obvious that the maximum is 
attained by choosing y 1 and so that, 

fi(Vi) ~ £i> f 2 ( 2 ^ 2 ) = ^2* 

Therefore it is impossible that the rule of cost minimization lead 
in this case to an optimal allocation of resources. The minimum 
cost solution is not socially efficient because it leaves unemployed 
resources which have positive usefulness, no matter how factor 
prices are chosen. 

D. Optimal Allocation Through Imperfect Competition . 

To deal with the drawbacks of Theorem 1 as a tool for deter- 
mining optimal resource allocation under increasing returns, we 
will turn to Theorems 3 and 4 which yield local saddle-point 
characterizations of the optimal allocation. Unlike Theorem 1, 
these conditions are satisfied only by local maxima, not by 
other extrema. 

First, consider the application of Theorem 3 to the resource 
application problem. The concavified Lagrangian with trans- 
formed constraints becomes, 

(75) L*(y v . . y n , x v . . ., x m \ p v . . ., pjtj) 

= TJ{y 1 , . . y n ) + i ti[ 1 - (1 - g<) 1+ H 
1=1 

where the excess supply g { = g t {, x v . . x m , y v . . y„) is defined 
by (46) and (47). 

The optimal allocation is characterized as a local saddle- 
point. Since L* is linear in the p/s, with respect to which there 
is a minimum, we have, 

(76) 1 — (1 — gi) 1+Vt ^ 0; for any i for which the strict 
inequality holds, pi = 0. 

Here, g ( is the excess supply at equilibrium, that is, 

&(*!, • • x m , y x , . . ., y n ). However, it is easy to see that the 
expression, 

1 - (1 - ZiY +r ", 

is positive, zero, or negative according as g ( is positive, zero, or 
negative; indeed it was originally chosen to have this property 
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(see remarks preceding (37) in part II, section C). Hence (76) 
is equivalent to, 

(77) gi ^ 0; if gi > 0, then = 0, 

as was to be expected. 

Let us combine the maximization part of the saddle-point 
criterion with (77). 

Theorem 14. Under certain regularity conditions, a neces- 
sary and sufficient condition for a locally optimal allocation 

is that, 

(a) y v . . y n , x v . . ., x m maximize the expression, U [y v . . .,y n ) 

+ - [1 - gifri, . . ; X m , y x , . . ., 2/J] 1+ ’*); 

Z=1 

(b) g t {xx> • • •> x m , y t , . . ., y n ) ^ 0; if g { (x v . . x m , 

Vi> ■ ■ ■> Vn) > then = 0, where the rj t ’ s have been 
chosen sufficiently large. 

Condition (a) can be given an institutional interpretation. 
Suppose the helmsman takes over the functions of the process 
managers in addition to his own, so that he chooses the x/s 
as well as the y t ’ s. At the same time, suppose that the custodian 
for commodity i purchases any amount u i available for an 
amount, ^[1 — (1 — This function is a total revenue 

function in the i th market. The helmsman then maximizes the sum 
of his utility and the revenue he obtains by selling the excess 
supply (the “revenue” would, of course, be negative if there 
were an excess demand). The helmsman on this interpretation 
is dealing with a series of markets on which he is an monopolist 
or monopsonist. 

In this formulation, the pi s do not play the role of com- 
petitive prices, but they are parameters which affect the location, 
though not the general shape, of the revenue functions. We will 
therefore refer to them as revenue parameters. Condition (b) 
insures that at equilibrium the revenue parameters are chosen 
so that the helmsman will have zero excess supply for free goods 
(when ^ — 0, the revenue is identically zero). 

So far the institutional interpretation has been highly central- 
ized. All that has been done is to rephrase the constrained 
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maximization problem as two problems, one an unconstrained 
maximization and one a feasibility condition. However, any 
unconstrained maximization has a certain element of decen- 
tralization implicit in it. If z v . . ., z p maximize a function 
f(z v . . ., z p ), then it is true that z x maximizes the. function 
f{z i, z 2 , . . ., z 9 ) as a function of the single variable z v and 
similarly with the other z/s. In the case of Theorem 9, there was 
an extreme simplification in the unconstrained maximization 
(given the p/ s), since the function to be maximized was a sum 
of functions each involving a different variable. The maximiza- 
tion in Theorem 14 is not so simple, but along the lines just 
suggested there is a degree of decentralization which, as we shall 
see, becomes more pronounced in the dynamic form. 

Let, 

(78) h ti = gi — g if (i = 1, . . s; / = 1 m ); 

(79) h i = g i + y i (i=l,...,n). 

Reference to (46 — 47) shows that h if is independent of x s , though 
it depends on the scales of all processes other than the and, 
in the case of desired commodities, on y { . Similarly, hi depends 
on the process scales but is independent of the y t ‘ s. If, following 
the argument above, we maximize with respect to x } , taking 
the yl s and all process scales other than the j th as given at their 
equilibrium values, we have that, 

(80) x } maximizes — [1 — — &,(*,) ] 1+? *} 

*=a 

the term U(y v . . ., y n ) being omitted since it is a constant with 
respect to x r Here as before a bar over a symbol denotes 
evaluation at equilibrium. Similarly, 

(81) g v ...,y n maximizes Ufa,..., 2/J+2&C 1 — (1— 

i= 1 

Equations (80) and (81) suggest a degree of decentralization. 
Each process manager chooses an x 3 so as to maximize (80), the 
helmsman chooses final demands so as to maximize (81). Each 
is now operating on imperfectly competitive markets. Each now 
needs somewhat more information from the outside than in the 
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concave case. The manager of the j th process needs to know 
not only the p/s but also the h if ’s ( i = 1, . . ., s); the helmsman 
needs to know both p t and Ti t (i = 1, . . ., n). 

The dynamic counterpart of Theorem 14 is derived from 
Theorem 6. If we apply the theorem to the resource allocation, 
we find by straightforward differentiation that, 

(82) L* i = U Vi -(l+t li )p i (l-gi) v1 > 

(83) L*=i (l+vMl-g*)"gr 

i-1 

The gradient method based on these derivatives requires the 
helmsman and the process managers to find the unconstrained 
maxima of (81) and (80) respectively, with equilibrium replaced 
by current values. The information required is greater than in 
the concave case of Theorems 10 and 11, but not too much 
greater. The helmsman and each process manager must now 
know the current values of both the revenue parameter and the 
excess supply for every market, in addition to his utility function 
or technology, respectively. 

The behavior of the revenue parameters is a straightforward 
dynamic analogue of (76). 

Theorem 15. Under certain regularity conditions and for 
jj/s sufficiently large, the system of differential equations. 


(a) 

(b) 


ijf = (0 if L* ( < 0 and y t 
\jL* otherwise, 

, ,, _ (0 if L* < 0 and 
' \L* otherwise; 


0 , 

0 , 


(0 if [1 - gi(x 1 , . . ., x m , y v . . ., y„)] 1+, ‘<l 
(c) d-pijdt = I and = 0, 

l[l— gi(*i» • • •. Vv • • •> y n )] 1+Vi — 1 otherwise; 


converges to a local optimum of allocation. Here £* and L* 
are defined by (82) and (83), respectively. 

Theorem 15 thus supplies a dynamic system which converges 
locally to an optimum and which makes very limited infor- 
mational demands upon its participants. 

Theorem 4 leads to a static characterization of an optimal 
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allocation similar to that of Theorem 14. The new feature is the 
existence of a set of variables, the w/s, which, in equilibrium, 
represent the permissible excess supply. If we refer back to the 
reasoning leading to Theorem 4, we see that the w/s were 
introduced to convert inequality constraints into equality 
constraints. At equilibrium, then equality must hold in the 
constraints g { [x l9 . . x m> y v . . ., y n ) — w t = 0. 

(84) g { = w t ; if g { > 0, then § t = 0. 

The second part of (84) is intuitively obvious and can be proved 
rigorously. 

In the saddle-point characterization supplied by Theorem 4, 
the concavified Lagrangian with quadratic modification has as 
maximizing variables the w ? s as well as the x's and the y/ s. 
However, if we follow the argument given above about the 
decentralization implicit in any maximization we may say that 
the helmsman and the process managers maximize the La- 
grangian with respect to the variables under their control, 
taking as given both the p/s and the w/s. We may refer to the 
w/s as the disposal parameters . Then, given both the revenue 
and the disposal parameters, the helmsman and the process, 
managers have, just as in Theorem 14, to maximize a nonlinear 
function of the inputs and outputs. Again they can be thought 
of as seeking to maximize profits and utilities by activities which 
include trading on imperfect markets. 

Theorem 16. Under certain regularity conditions and for A 
sufficiently large, a necessary and sufficient condition for a. 
locally optimal allocation is that, 

(a) y v . . y n> x v . . ., x m maximize the expression, 

y») + 2 \$t{gi - - Hg { - 

i—1 

(b) g { = 0; if w { > 0, then pi = 0. 

The dynamic counterpart of Theorem 16 derives from ap- 
plication of Theorem 6 to the resource allocation problem. The* 
differeiftiations involved are straightforward. 

Theorem 17. Under certain regularity conditions and for A. 
sufficiently large, the system of differential equations, 
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(a) dyjdt 

(b) dxjdt 

(c) dwjdt 


jO if Ll t < 0 and y t = 0, 

\L\ i otherwise, 

|0 if Ll f < 0 and x } = 0, 

| L\ otherwise, 

|0 if p ( — 2 — w t ) > 0 and w t = 0, 
pi + 2A(g i — w t ) otherwise, 


(d) dfijdt = 


0 if gi > Wi and p t = 0, 
w t — gi otherwise, 


converges to a local optimum of allocation. In (a) and (b), 
respectively, 

(e) Ll t = U Vi - Pi + 2 X(g { - w^, 

(f) Vi-Mb-vM- 

i = i 

The adjustment formulas for the individual participants are 
comparatively simple. The helmsman and each process manager 
have to know, in addition to their own utility function or 
technology, the revenue parameter, p t , the disposal parameter, 
Wt, and the excess supply, g t , for each commodity. The custodian 
for commodity i has to know g t and he can then determine 
both the revenue and the disposal parameters from the pair of 
differential equations (c) and (d). 


E. Optimal Allocation Through Nonlinear Price Adjustment. 

If we examine (82) and (83), we see that there is a common 
expression in all of them, for which we may introduce a symbol, 

(85) q t = (1 + rjMl - &)". 

If we now refer to (a) and (b) of Theorem 15, we see that the 
helmsman or the process managers can respectively be thought 
of as seeking to maximize the difference between utility and 
costs or profits, if the q> s are now regarded as prices. These 
prices are now determined by a fairly complicated adjustment 
pattern; first an auxiliary variable ^ is determined by equation 
(c) of Theorem 15 and then q t is calculated from (85). However, 
the informational requirements are low, since the custodian, who 
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is charged with these operations, need still know at any one 
time only the excess supply g f . 

Theorem 18. Under certain regularity conditions and for 
Hi s sufficiently large, the dynamic system, 


(a) dyjdt 


U yt < q t and y t = 0, 
q { otherwise, 


(b) dx. 


_ jO if L 

~\u V{ - 

, fO if dnjdxj < 0 and x s = 0, 
— \dn 3 jdx s otherwise. 


(c) q t = (1 + vM 1 ~ &) 7 ‘> 


(d) dfijdt = { 


0 if gi > 0 and = 0, 

(1 “ g*) 1 * 57 *' ~ 1 otherwise, 


converges to a local optimum of allocation. In (b), 


(e) n^) =J,q i g ij (x j ). 

i—l 

In Theorem 18, the informational requirements on each 
participant are no different than in the concave case of Theorem 
10. The only elements of centralization are that a prior decision 
must be made on the constants rji and that a global optimum 
can only be assured if the initial approximation is sufficiently 
close. 

A limiting case of some interest arises if the adjustment speed 
of the helmsman and the process managers are increased in- 
definitely. In that case, for the helmsman we must have U V{ = q { 
unless yi = 0 and similarly dnjdx^ = 0 unless x j = 0. 63 ) Then 
we return to the supply-and-demand model briefly discussed in 
section B. But the price-adjustment rules of (c) and (d) of 
Theorem 18 now insure stability. 

Theorem 19. Theorem 18 remains valid if (a) and (b), are 
replaced by. 


6S ) It should be made clear that in this process x s is not necessarily chosen 
to maximize process profits, taking the q/s as given prices. The chosen x s does 
maximize the expression (80), with and h t3 being replaced by their current 
values, pi and h tJ ; with the notation of (85), the statement in the text follows. 
Thus, as i-ange correctly noted, the rule that marginal profitability be zero 
replaces the rule of profit maximization for increasing returns; however, the 
above analysis shows that the price-adjustment rule which leads to a stable 
allocative system is not a simple one. 



102 


ESSAYS IN ECONOMICS AND ECONOMETRICS 


(a') U V( ^ q { ; if U y> < q it then y t = 0; 

(b') d7tjdx } ^ 0; if dizjdx, < 0, then x } — 0. 


F. Optimal Allocation Through Price Speculation . 

An alternative institutional interpretation can be given of 
Theorem 17, in a manner somewhat similar to that which led 
from Theorem 15 to Theorem 18. If we look at (e) and (f) of 
Theorem 17, we find it natural to introduce the definition, 


(86) fi = Pi + 2AK - g { ). 


If we now take account of (d) of Theorem 17, we see that, unless 
both g { > and p t — 0, 

(87) ft = Pi + 2 X{dpjdt), 


that is, p{ can be considered as an expectation of the price of 
commodity i in the near future formed by extrapolation of 
current rates of change. 64 ) We will now argue that (87) can be 
made to hold without the exception noted above. As noted in 
the Remark following Theorem 4, the saddle-point properties 
of the concavified Lagrangian with quadratic modification hold 
whether the p t ’s are considered to be non-negative or unrestricted 
as to sign. Relation (d) of Theorem 17 is the appropriate form 
if the p { ’s are restricted to be non-negative. If they are taken 
to be unrestricted as to sign, then only the second line is relevant; 
Theorem 17 remains true with this alteration. Then (87) is true 
in general. 

Theorem 20. Under certain regularity conditions and for A 
sufficiently large, the system of differential equations, 


(a) dyjdt = j 


0 if U V( < p\ and y t = 0, 
Uy t — Pi otherwise. 


64 ) Extrapolative expectations of the type of equation (87) have been studied 
in connection with inventories by L. Metzler, “The Nature and Stability of 
Inventory Cycles," Review of Economic Statistics, XXIII (1941), 113 — 29, 
and in connection with inflationary price movements by A. C. Enthoven, “Mone- 
tary Disequilibria and the Dynamics of Inflation,” Economic Journal, Vol. 66 
(1956), 256 — 70; see also, A. C. Enthoven and K. Arrow, “A Theorem on Ex- 
pectations and the Stability of Equilibrium,” Econometrica, Vol. 24 (1956) 
288 — 293. More detailed study of the system studied in this section will be 
found in Arrow and Solow, op. cit., sec. 4. 
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(b) dxjdt = 


0 if djdJdXj < 0 and x, = 0, 
dn'/dXj otherwise, 


(c) dwjdt = 


|0 if pi > 0 and w { = 0, 
\—p{ otherwise, 


(d) dpjdt = w t — g„ 

(e) fi'< = p t + 2X(dj>Jdt). 

converges to a local optimum of allocation. In (b), 

(f) n}(x 3 ) = 2 #£«(*#)• 

i=l 

Relations (a) and (b) show that the helmsman and the process 
managers are moving in the direction of maximum benefits 
computed at expected prices, rather than current ones. Equa- 
tions (c — e) can be given various institutional interpretations; 
one is that, taken together, they instruct the custodian how to 
determine the expected price to be announced to the other 
participants. It is required that the expected price be sufficiently 
sensitive to the current rate of change. 65 ) 


G. The Linear Case. 

We return finally to the case which started the entire in- 
vestigation, that where the utility function and all the processes 
are linear, that is, the case of linear programming. In this case, 
as we have seen in part II, section D, the gradient method will 
normally lead to endless oscillations. Since stability has been 
shown to be related to the strict concavity of the Lagrangian, 
it is natural to try one of the concavification methods introduced 
in part II, section C. Since the problem is linear, the Lagrangian 
is, so to speak, on the borderline of strict concavity, and it 
might be expected that any modification of the Lagrangian 
which increases its concavity would suffice. This expectation 
is correct, at least as far as the method of concavification by 
transformed constraints. 


65 ) The system of Theorem 20 and its interpretation in terms of price ex- 
pectations is similar but not identical to the proposal of T. Kose, "Solutions 
of Saddle Value Problems by Differential Equations/’ Econometnca , Yol. 24 
(1956), 59—70. 
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Theorem 21. If U {y v . . y n ) and the process functions 
gijixj) are all linear, then the system of differential equations 
of Theorem 15, with any rj ( > 0 converges to a global optimum 
of allocation. Equivalently, the dynamic systems of Theorems 
18 and 19 converge to a global optimum under the same 
condition. 66 ) 

The method of Theorem 21 is by no means the only gradient 
method for the solution of linear programming problems. Because 
of their equivalence to zero-sum two-person games, 67 ) the 
modified gradient methods developed by Brown and von Neu- 
man for solving the latter can be applied to the former. 68 ) Of 
course, there are other methods, of which the best known is the 
simplex method, which are essentially different from the gradient 
methods. Although there has been as yet insufficient com- 
putational experience, it is very likely that the simplex method 
is a superior method for computation to any variety of the 
gradient method. But from the viewpoint of the present study, 
it lacks the very important virtue of decentralization. 


6S ) For a proof of Theorem 21, see Arrow and Hurwicz, “Gradient Method 
for Resource Allocation,” op. cit ., Theorem 3. An illustration of this method 
with remarks on the problems of machine calculation is found in T. Marschak, 
“An Example of a Modified Gradient Method for Linear Programming,” ch. 9 
in Arrow, Hurwicz, and Uzawa, op. cit. 

67 ) See footnole 37. 

® 8 ) See G. W. Brown and J. von Neumann, “Solutions of Games by Dif- 
ferential Equations,” in H. W. Kuhn and A. W. Tucker, eds.. Contributions 
to the Theory of Games I, Annals of Mathematics Study No. 29 (Princeton, 
New Jersey: Princeton University Press, 1950), pp. 73 — 80. For the application 
to linear programming, see M. Fukuoka, “A note on Convergence in Linear 
Programming Problems,” Cowles Commission Discussion Paper: Economics 
No. 2108, July 14, 1954. 
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On the Theory of Utility and Demand 
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External Economies in Consumption 1 . 


The importance of external economies in production is well 
recognized. Recent literature has treated the theory of external 
economies in consumption. 2 ) This may have some importance 
for welfare economics. 3 ) It is one way of getting away from the 
unrealistic assumption of a society consisting of atomistic in- 
dividuals, who do not influence each other. Such a procedure 
has the advantage of simplicity, but as Aristotle remarked, man 
is a political animal (Politics, 1.29 1252b). 

We assume a society consisting of K individuals. Let u {k) 
be the utility function or utility index of the individual k 
(k = 1, 2 ... K). Let x kn be the amount of the commodity or 
service n consumed by the individual k (n = 1 , 2 . . . N). We 
assume that the money income M k of the individual is given, as 
well as the prices (n = 1, 2 . . . N) of all commodities and 
services. We denote by: 

(1) *. = !**. (n=l,2...N) 

km 1 

the total amount of commodity or service n demanded. We 
neglect the possibility of coalitions and other complications 
arising in the theory of games. 

x ) I am much, obliged to Dr. R. L. Basman (Hanford, Wash.) and to M. 
Wimbee (Paris, France) for advice and criticism of this paper. Journal Paper 
No. J-3766 of the Iowa Agricultural and Home Economics Experiment Station, 
Ames Iowa. Project No. 1200. 

2 ) J. S. Duesenberry, Income, Savings and the Theory of Consumer’s Behavior . 
(Cambridge, Mass.: Harvard University Press, 1949), H. G. Johnson, “The 
Effects of Income Redistribution on Aggregate Consumption with Interdepend- 
ence of Consumer's Preferences," Economica , vol. 19 (1952), 131 ff. J. S. 
Prais aad H. S. Houttakkei, The Analysis of Family Budgets (Cambridge: 
Cambridge University Press, 1955). 

3 ) G. Tintner, “A Note on Welfare Economics," Econometrica, vol. 14 (1946), 
69 ff. 
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Following Allen, 4 ) we denote by: 

(2) Ef/Ez=(df/dz)(zlf) 

the elasticity of a function / with respect to a variable z. 

We present first the classical atomic theory. 6 ) Let 

(3) u^ k) = u^(x kl , x k2 , . . . x kN ) 

be the utility function of the individual k, which depends only 
upon the quantities of commodities and services possessed by 
himself. The function (3) has to be maximized under the budget 
restriction: 

W ^X kn p n = M k . 

71=1 

Now we define: 


(5) du m ldx kr = u$; d* u™\dx kr dx ks = ks 


and the matrix with N + 1 rows and columns: 


( 6 ) 


XJ m 


__ ' 0 
= -ul 


<») 

ks 


< 1 
^kr, fcH 


(r, s= 1,2... N). 


Let U$ be the cofactor of u ( k J in the inverse matrix 
and ks the cofactor of ks in the same matrix. Then we 
have the well known results: 


(7) dxJdM k = L k UW 

(8) dx k Jdp r — -L k U$x kr +L k UW ks = -x kr {dxJdM k ) +L k U kr>kt . 

In these formulae L k is a Lagrange multiplier, the marginal 
utility of money for individual k. We have for superior com- 
modities: 


(10) dxJdM„ > 0 
and for inferior commodities: 

(11) dxJdM k < 0. 

L k U$ ks is the substitution term. If 

( 12 ) ’ U™ k3 < 0 

4 ) R. G. D. Allen, Mathematical Analysis for Economists (London: Macmillan, 
1949), pp. 251 ff. 

6 ) J. R. Hicks, Value and Capital (2nd ed., Oxford: Clarendon Press? 1 1956). 
P. A. Samuelson, Foundations of Economic Analysis , (Cambridge, Mass.: 
Harvard University Press, 1947, H. Hotelling, ‘‘Demand Functions with 
Limited Budgets,’* Econometrica, vol. 3 (1935) 66 ff. 
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then the commodities r and s are substitutes. If 

(13) tfJJU > 0 

then r and s are complementary commodities or services. 

We see from formula (8) that we may have dx k Jdp a > 0, in 
spite of the fact that U { ka \ ka < 0. This is the Giffen phenomenon 
which occurs for inferior goods. 

Now we define: 

(14) m kr = p r x kr /M k 

the proportion of the expenditure of individual k on commodity 
r in relation to his total expenditure or income. Also the partial 
elasticity of substitution: 

(15) ks = (2 X kn U kn ) ^kr, kal i. X kr * x ks ) 

71=1 

The elasticity with respect to price is now given by: 

( 16 ) ExJEp r = m kr {o krM - Ex JEM k ). 

For the total demand in the market for commodity s we have: 

(17) dXJdM k = dxJdM k 

( 18 ) dXJdp r = 2 fcjdpr 

fc=l 

and for the elasticities: 

( 19 ) EXJEM k = ExJEM k 

( 20 ) EXJEp, = | (ExJEp r ) (xJX s ). 

fc = 1 

The price elasticity of demand for the total demand in the 
market is the weighted sum of the individual demand elasticities, 
the weights are the proportions x k8 /x s of the quantity demanded 
by individual k in relation to the total demand for the com- 
modity in question. 

Now we assume that the utility index depends on all the 
incomes in the society: 

( 21 ) V W = ^ k) {x kl y X k 2 • • • %kN> M ^ . • ■ Mr) 

This is perhaps a somewhat realistic assumption, since the tastes 
of individuals depend upon the social class of the individuals 
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and social standing is to some extent depending upon income, 
at least in America. 6 ) 

We define: 

( 22 ) dv >w/dM t = v§. 

Now we have: 

( 23 ) dx k JdM t — L k U)® • 8 kt ^ Ukl, kn V kn(t) • 

71=1 

In this formula d kt denotes the Kronecker delta S kt = 1 if k — t, 
8 kt = 0 otherwise. 

The income elasticity appears now as: 

( 24 ) Ex JEM t = SJExJEM k ) 0 (MJM k ) 

-l(Ev^lEM t )m kn a^ kn . 

71=1 

In this formula ( ExJEM k ) 0 is the income elasticity as defined 
in formula (11) for the atomic case. Let us put t — k, i.e. we 
consider the elasticity with respect to the income of the individual 
k in question: 

( 25 ) Ex JEM k = (ExJEM k ) 0 - 1 (Ev^/EM k )a^ kn . 

71=1 

We see from this formula, that income elasticities and elasticities 
of substitution are no longer independent. Because of the 
dependence of the utility function on all incomes, we may have 
in ( 25 ) ExJEM k < 0 even if the atomistic income elasticity 
(ExJEM k ) 0 is positive, i.e. we have a superior good in the 
classical sense. This is due to the sum in formula ( 25 ), which 
may be large and positive. 

Similarly we have for the total demand: 

( 26 ) EX JEM (Ex JEM t ) 0 -f f (Ev^/EM t )o^ kn . 

fc=l 71=1 

Now we consider the most general case in which the utility 
of each individual depends upon the consumption of all individuals 
in the given society. We may think of the Veblen effect and 
similar phenomena. 7 ) 

6 ) T. Parsons and N. J. Smelser, Economy and Society (London: Routledge* 
1956). 

7 ) T. Veblen, Theory of the Leisure Class (New York: Modern Library, 1945). 
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(27) = w w (x n , x 12 . . . x lN , x 2V 

X 22 , • . . *2IV ■ • • X K1> X K2 • • • x Kn)- 

We define a compound matrix, i.e. a matrix consisting of 
matrices: 


(28) W=[AM], (i, j= 1,2... if), 

where the individual matrices are defined as follows: 

T 0 <5 

LO ij- “'is “'ir, is J 

The inverse of the matrix W (28) is again a compound matrix: 

(30) W* 1 = [£«*>]. 

Now we obtain: 

(31) dx ks /dM t = B™-L t . 


The income elasticity is: 

(32) ExJEM t = BW ■ L t • M t /x k ,. 


We obtain also: 

(33) dx k Jdp T = - 1 * <r L, + | L, 

t=l i»l 

and for the price elasticity: 

(34) ExJEp r = J'-m ir {ExJEM i )+m iT o l g ) iis • (xjxj. 

<= 1 

In formula (34) we denote by a^ it the partial elasticity of 
individual k for the substitution of commodity r for commodity s 
by the individual i. 

For the market demand, we obtain the income elasticity: 

(35) EXJEM t = | {Ex JEM t ) 

s= 1 


and for the price elasticity: 

K K 

(36) EXJEp, = 2 1 -t»* {Ex JEM,) +m ir (xjxj . 

k—1 t*«l 

The methods used by Wald to derive empirical indifference 
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systems 8 ) and the recent studies based upon household budget 
surveys 9 ) give us hope that an empirical, econometric verifica- 
tion of the theory may be possible. 


8 ) A. Wald, “The Approximate Determination of Indifference Systems by 
Means of Engel Curves/’ Econometrica , vol. 8 (1940) 144 ff. See also the results 
of J. A. Nordin in G. Tintner, Econometrics (New York: Wiley, 1952), pp. 60 ff. 

9 ) R. Stone, D. A. Rowe, W. J. Corlett, R. Hurstfield, and M. Potter, The 
Measurement of Consumer's Expenditure and Behaviour in the United Kingdom , 
1920 — 38 (Cambridge: Cambridge University Press, 1954) H. Wold and L. 
Jurden, Demand Analysis (New York: Wiley, 1953). J. S. Prais and H. S. Hout- 
hakker, op. cit. 



RALPH W. PFOUTS 


Hours of Work, Savings and the Utility 
Function 


The contributions of Harold Hotelling to the theory of 
utility and demand place him in a small group of pioneers that 
altered the fundamentals of economic theory. His work, coming 
at about the same time as that of Hicks, Allen and Schultz, 
and coinciding approximately with the discovery of Slutsky’s 
work, helped to lay to rest older concepts of measurable utility, 
to unite utility and demand theory and to rid the theory of 
unnecessary ceteris paribus assumptions. Through these advances 
a coherent theory of utility and demand was provided for the 
segments of economics that must rely on utility and demand 
theory. 

The purpose of this paper is to present a modest extension 
of this utility and demand theory that has become the present- 
day orthodoxy. Specifically, the suggestion will be made that 
hours of work and savings are variables affecting psychological 
values and are deserving of consideration to the same extent 
as the quantities of the various goods consumed. 

I 


The Assumptions 

If hours of work and savings are inserted into the utility 
function, several additional results are achieved that cannot 
be deduced in the case in which only the quantities of the 
goods appear as independent variables. In the case that in- 
cludes hours of work and savings in the utility function, a more 
elaborate and more realistic prototype of the consumer is 
available, and monetary considerations can be brought into 
the pattern of consumer behavior. This suggests that the more 
elaborate case may yield a consumer element suitable for 
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aggregation into marginal propensity to save or marginal pro- 
pensity to consumer schedules, thus bringing micro-theories 
and macro-theories into closer alignment. Furthermore, new 
demand restrictions, in addition to modification of known 
restrictions, are obtained from this formulation. 

In this paper we accept without extensive justification the 
desirability of including hours of work and savings in the utility 
function. In any case, this seems unnecessary because discussions 
of demand for leisure can be found in economic literature dating 
from the time of the classical economists. 1 ) It would seem clear 
and acceptable that hours of work affect both the individual’s 
level of satisfaction and his income. 

Similarly there seems to be little need for justifying the 
inclusion of savings in the utility function. If savings were not 
actively desired by individuals, persons in the lower and middle 
income groups would spend their money on goods and services 
entirely, and savings would be accumulated only by those with 
such large incomes that they could not spend the entire amounts 
on consumers’ goods and services. As Lord Keynes put it, an 
act of saving by an individual “means — so to speak — a 
decision not to have dinner today.” 2 ) 

Thus we write the utility function as 

(I) U = &{x v . . ., x n , y, s), 

where the a;’s indicate quantities of goods and services, y 
represents hours of work and $ stands for current savings in 
dollars. We assume that 


x ) Among the more noted and more recent writings on this subject must be 
listed: A. C. Pigou, The Economics of Welfare (London, 1929), p. 573, and 
A Study in Public Finance, (London, 1928), pp. 83 — 84. F. H. Knight, Risk, 
Uncertainty and Profit (New York and Boston, 1921), p. 117. L. Robbins, 
“On the Elasticity of Demand for Income in Terms of Effort,” Economica, X 
(1930), 123 — 29, reprinted in A. E. A. Readings in the Theory of Income Distribu- 
tion, W. Fellner and B. F. Haley, eds. (Philadelphia and Toronto, 1946), pp. 
237—44. F. W. Paish, “Economic Incentive in Wartime,” Economica, N. S. 
VIII (1941), 239 — 48. G. Cooper, “Taxation and Incentive in Mobilization,” 
Quarterly Journal of Economics, LXVI (1952), 43 — 66. ® 

2 ) Keynes, The General Theory of Employment, Interest and Money (London 
and New York, 1936), pp. 19, 21, 64 — 65, 210 — 13. 
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dU 

dXf 


= ®i> 


0 , 


dU 

dy 


= 0 V <O, 


dU 

ds 


= & s >0. 


In other words goods and savings have utility; work, at least 
after a certain time, has disutility. 

Since current savings are introduced into the utility function, 
a stock of earning assets must be considered in the budget 
equation as a source of income. In addition, since hours of work 
are included in the utility function, it is appropriate to include 
the wage rate in the budget equation. As a consequence the 
budget equation is 

(2) 2 Pi x i + s = wy + rM. 

i 

In this equation the p's are the prices of the goods and services, 
w is the wage rate, M is the stock of earning assets and r is the 
rate of return. The p's, w and r are assumed to be mutually in- 
dependent variables. Clearly M includes the accumulation of 
savings from past periods,, but it may also include windfalls. 3 ) 

There are advantages to be obtained from writing the utility 
equation in the form of (1) and the budget equation in the form 
of (2) that we will note only briefly. In the first place these 
equations specifically include hours of work and consequently 
should be readily usable in welfare economics. That is, an in- 
dividual's economic welfare depends not only the income he 
receives, or the goods and services he purchases, but also on the 
hours of work he must render to obtain the income, or the 
goods and services. Through introducing hours of work directly 
into the equations that determine the consumer's equilibrium 


s ) By using rM in (2) we implicitly assume either that the individual puts 
all his earning assets in uses showing the same rate of return or that r is an 
average mte of return. It would be more realistic to write £ r t M t in place of rM 
to show a differentiated portfolio. This would get into complications that it 
seems desirable to avoid for purposes of the present paper i.e., we avoid all 
problems of expectations, principles of portfolio selection, etc. 
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we introduce directly an important component of welfare that 
previously was ignored or introduced by indirection. 

The usual budget equation can be modified to include hours 
of work by treating work as a negative good. The budget restric- 
tion then is 

2 Pi X i ~ 0* 

i 

The disadvantage of this equation, beside appearing unnatural, 
is that there is no fixed income exhibited as in the common form 
of the budget equation and there is no term corresponding to 
rM in (2). As a result the income effect cannot be readily identi- 
fied if resort is made the treating work as a negative good. Hence 
the advantage of introducing hours of work is offset by obscuring 
the income effect. This difficulty does not arise in the formulation 
advanced below. 


The Equilibrium 

Then to find the consumer's equilibrium position under the 
assumptions outlined above, we form the equation 


(3) A = 0(x it . . ., x„, y, s) + A( 2 Pi%i + s — wy — rM). 

i 

This yields equations that, together with (2), describe the 
maximum satisfaction position for the consumer: 


(4) 


BA 

Bx t 


= $< + Ipi = 0 , 


— = 0 y — hsi = 0, 
dy 


BA 

ds 


= 0 a + X = 0 . 


{i = 1, . . ., n). 


Equations (4) and equation (2) are clearly analogous to the 
■orthodox method of determining the consumer’s equilibrium 
position. 

II 


The Effects of Price Changes 

As would be expected from recalling utility theory that does 
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not include hours of work and savings, the most interesting 
points arise from consideration of displacements from the 
equilibrium condition. Thus a change in the price of the kth 
good leads to the equations 

_ dx, dy ds dX 

( 5 ) = ~ 

i Ofk °Pk tyk °P k 

where 8 ik is the Kronecker delta and i = 1 , . . ., n. 

In addition we have 

_ _ dx, dy _ ds dX 

+ ®vvTZ~ + w ~ °> 

i dp k dp k dp k dp k 

... _ , dx, dy _ ds dX 

(6) 2 ®si aT - + ®»vTT + TT + pT~ = °> 

i dp k dp k dp k dp k 

v , dxj dy ds 

+ = -**• 
j °Pic dp k dp k 

Then proceeding in the orthodox manner we make use of the 
determinant 


(?) 


D = 


<?u 

. . 

• • • 

*1, 

*u 

Pi 

0 , 

0 

. . . 

*nv 

0 

^ ns 

Pn 


. . . 

0 

^VV 

0 

^ vs 

— w 

<P* 

• • • ^sn 

0 

^av 

0 

1 

Pi 


—w 

1 

0 


and Cramer’s rule to obtain 


(8) 

dx { 



_ x 
x D x * 

(9) 

dy _ 

- Dk, n+1 

dp k ~ 

D 

(10) 

II 

5 &k,n+2 

1 D 


D 


n+3 ,i 


D 


D 


(*' = 1. 


— X k 


n+3, n+1 


D 


— x. 


D 


n+3, n+2 

~D 


In (8), (9) and (10) the subscripts to D indicate cofactors. The 
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last three equations are clearly comparable to the Slutsky-Hicks 
equation. 4 They show the effect of a price change on the 
quantities of the goods purchased by the individual, the hours 
of work the individual wishes to offer and the savings that he 
wants to lay by. In each case the effect of the price change is 
divided into two parts, again obviously comparable to the 
substitution and income effects of the Slutsky-Hicks equation. 

To continue the parallel with the Slutsky-Hicks equation, we 
examine the income effects directly. Under the assumptions 
that we are using, there is no single income effect, but rather 
income effects arising from either a change in the rate of return 
or a change in the stock of earning assets. Thus if we differentiate 
equations (4) and equation (2) with respect to r, the rate of 
return, we get 


( 11 ) 


___ dx* dy ds dX 

+ + (*-i. ») 


dx t 

dr 


2 ®vl + ®vv a „, + ®vs ^ 


dy_ 

'dr 


ds 


w 


dX 

dr 


0, 


dx } t ^ dy . „ ds dX 


dr dr ' dr 


Solving (11) yields 

(12) 


_ nj Dn+Z.i 

dr D ’ 


( i —l,..., n), 


(13) 


ty _ ^ ^»+3,«+l 
dr D 


(14) 


71 r ^n+ 3 , ?i+2 

dr~ D 


Then substituting from (12) to (8), from (13) to (9) and from 
(14) to (10) gives us 


(15) 


ftp* x D u x h dXj 
df k D M dr’ 


(* — l*ff • •> n ) 


4 ) See, for example, J. R. Hicks, Value and Capital (2nd ed., Oxford, 1946) 
pp. 306—9. 
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(16) 

(17) 


dy _ 

>} ^k,n+ 1 

%k dy 

1 

1 * 
bg. 

A D 

M dr 

ds 

, ^ft,n+ 2 

x k ds 


1 D 

M dr 


In equations (15), (16) and (17) the effects of a price change 
on the quantity of a good, the hours of work and savings are 
divided into a substitution effect and an income effect, the 
income effect resulting from a change in the rate of return. We 
can obtain equations similar to (15), (16) and (17) by considering 
the effects of a change in M, the stock of earning assets. To do 
this we differentiate equations (4) and equation (2) with respect 
to M obtaining 


_ dxt dy ds d). 

^ 0ii m + 0iv m + 0t 'dM +:pi dM =0 ’ {i=h • ' • ' " ' n) 


(18) 


2 aiu- + 

3 


, ds dX 

+ ^VS r W -^77 ~ 


dM ' ~ vv dM ' ‘ va dM ~dM 


_ ^ dx< ^ dy ^ ds dX 
? + sv dM + 0 *‘dM + dM = °’ 

dx, dy ds _ 
dM W dM + dM 


Solving equations (18) yields 


(19) 


_ -P|i+3,i 

dM ~ D ’ 


(i n), 


(20) 


_ ^w+3, n+1 

dM D ’ 


(21) 


ds 

dM 


D 


= r- 


w+3, ti+2 

~D 


Then substituting from (19) to (8), from (20) to (9) and from 
(21) to. (10) shows 


dX( _ D ki x k dx t 
dp k Dr dM ’ 


(22) 
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(23) 

(24) 


jty = _ 2. Dk - n+1 Xk dy 

dp k D r dM * 

— — Xk - 

d-p k D r dM 


Inspection of equations (22), (23) and (24) again shows the 
effect of a price change on the quantity of a good, hours of 
work, and saving divided into a substitution effect and an in- 
come effect, the income effect now being related to a change in 
the stock of earning assets. Again the similarity to the Slutsky- 
Hicks equation is obvious. 


The Effects of a Wage Rate Change 

A displacement from equilibrium due to a change in the 
wage rate also shows a compound effect similar to that of the 
Slutsky-Hicks equation. To approach this we find the partial 
derivatives of equations (4) and equation (2) with respect to 
the wage rate, obtaining 


dx, ^ dy _ ds dX .. 


(25) 


dx s 
' dw 


dy 


+ + 0 vs— — w~ = X, 


ds dX 

. w — 

dw dw 

ds dX 


y0 _i_ 0 = o 

j ei dw sy dw **dw dw 

dx , dy ds _ 

dw W dw + dw V ' 

Solving equations (25) shows us that 

^ 71+3, i 


(26) 

(27) 

(28) 


^ X i ,Dn+l,t | ^Pn 


y ^ 71 + 1 , tt-rl 

dw D 


+ : 


D 


ds 

dw 


X 77+2 


+ y- 


71+3, 71+1 

~D~‘ 


71+3, 72+2 


D 


{i =l,...,n) 
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The similarity of equations (26), (27) and (28) to the Slutsky- 
Hicks equation is obvious and, as in the case of a response to a 
price change, we have a choice of two income effects. Thus we 
can substitute from (12) to (26), from (13) to (27) and from 
(14) to (28) obtaining 


(29) 

dx { D, 

(30) 

dy 
d w 

(31) 

ds A 

dw - 


71+1, t 

D 


<y_ d J0i 

^ M dr’ 


72+1, n-t-1 


y dy 


D 


72+1, n+2 


M dr 
y ds 


D 


M dr 


{i = 1, . . ., n). 


Alternatively we can substitute from (19) to (26), from (20) 
to (27) and from (21) to (28); this yields 



dx £ _ 

-5 ^n+l t i | 

y_ 

dx t 

\y*j 

dw 

D 1 

r 

dM’ 

/QO\ 

dy _ 

1 ^n+1, n+1 

I 

y_ 

dy 

\ 66 ) 

dw 

D 


r 

dM’ 


ds 

1 -^72+1, 72 +2 

j_ 

y_ 

ds 


dw 

D 


r 

dM' 


In these latter equations the income effect shows the result of 
a change in the stock of earning assets. 


Restrictions on Consumer Behavior 

For many years critics of utility and demand theory have 
argued that this theory was too restrictive to constitute a 
reasonable description of human behavior. Basically the ar- 
gument contends that, in the theory, the path of a consumer’s 
behavior is preordained and that he responds too mechanically 
to changes in prices and income; that too few allowances for 
individual deviations are made; in short that he is Veblen's 
“oscillating globule of desire.” In considering the validity of 
this criticism it is instructive to consider the changes in quantities 
of goods, hours of work and savings as developed in this paper. 
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It is quite literally true that in no case developed in this 
paper can we be certain of the direction of change of quantities 
of goods, hours of work or savings in response to a change in 
prices, wage rate, rate of return or stock of earning assets. 
That is, on the basis of logic or theory alone, we cannot be 
certain that a change in any dependent variable with respect 
to any independent variable is invariably positive or negative. 
For such conclusions we must resort to empirical observation. 
Surely this is not an overly restrictive representation of consumer 
behavior. 

Indeed in only two of the cases considered does the mathe- 
matics show any indication of the direction of change. One of 
these is the well known instance: 

5 1 

dp~ D r m' 

i.e . the case of the response of the quantity of a good to a change 
in its own price. In this equation the first term of the righthand 
member must be negative. This is true because the principle 
minors of the determinant in (7) oscillate in sign, hence the ratio 
of cofactor to determinant must be negative, while —A must be 
positive. But this is not sufficient to insure a negative response 
of the good to a change in its own price because the second term 
of the righthand member could be positive and larger in absolute 
amount than the first term. 

The second instance in which the mathematics gives some 
indication of the direction of change is the case of the response 
of hours of work to change in the wage rate. The equation in 
this case is of the type 

. , -Pn+i,>H-i , y 

dw D "r dM' 

In this equation the first term of the righthand member is 
obviously positive, but again this is not sufficient to assure 
that hours of work increase in response to a change in wage 
rate because the income term can be sufficiently negative to 
offset the other term. Indeed, since the income term represents 
a change in unearned income, it seems quite probable on empirical 
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grounds that the income term will be negative and quite large. 
Consequently we cannot be certain of the direction of change in 
this case either, or, indeed, in any case without reference to 
empirical observation. 


Ill 

Some Characteristics of the Demand Equation 

The equilibrium conditions, equations (4) and (2), represent 
n + 3 conditions on the 2n + 6 variables, x v . . x n ; p lt . . ., p n ; 
y, s, w f r , M and X. Then assuming that there are no mathe- 
matical difficulties, it is possible to solve for n + 3 of the 
variables in terms of the remainder. 5 ) It is of special interest 
to view these solutions as demand equations written as 


(35) 

X i K ( Pl> * ’ *> Pn* 

M, r, w), 

(i = 

(36) 

II 

;pn. M, r, 

w), 

(37) 

s = h(Pl> ■ ■ 

;Pn, M, r, 

w), 

(38) 

1 

1! 

3 s " 

H* 

• •> Pn> 7 

w). 


In this form the equations show as dependent variables the 
quantities that are actually subject to control by the individual. 

The demand equations that we have formulated are free of 
any particular index, i.e., it does not make any difference what 
utility index we use. Thus the situation with respect to the 
choice of utility indicator is the same as in the case in which 
savings and hours of work are not included in the utility function. 
The freedom from the utility index may be seen by rewriting 
equations (4) as 

(39) - X = T- = (*•=!. •••>")• 

pi W 

This shows, among other things, that the marginal utility of 
money is equal to the marginal utility of savings in the equili- 
brium position. From equations (39) we obtain 

5 ) See P. A. Samuelson, Foundations of Economic Analysis (Cambridge, 
Mass., 1947), pp. 99 — 100. 
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(40) 
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where F is a monotonic transformation. Equations (40) show 
that if a monotonic transformation of the utility function were 
made, it would have no effect on the equilibrium position. 6 ) 


The Absence of a Homogeneity Condition 

In the orthodox development of the theory of individual 
demand, the demand equations are homogeneous of order zero 
with respect to prices and income. 7 ) This arises from the absence 
of savings in the budget equation and from equilibrium conditions 
similar to the first equations of (40) . In the formulation developed 
in this paper, the individual demand equations are no longer 
homogeneous of order zero with respect to the independent 
variables. This may be seen from inspection of equation (2). 
Since savings have no price in the ordinary sense, a proportional 
change in the prices associated with other variables does not 
affect the term representing savings. Consequently the equality 
in (2) is not maintained. Similarly, from inspection of the last 
two equations in (40), it is clear that a proportional change 
in prices and wage rate will alter the equilibrium position. 
Consequently, the restriction that the demand equations be 
homogeneous of order zero is lost in the present formulation. 
It is interesting to notice that this restriction is preserved if 
hours of work are put into the utility equation and savings 
are not. 8 ) 

6 ) Ibid ., pp. 97 — 99. 

7 ) Ibid ., pp. 104—7. 

8 ) See F. Gilbert and R. W. Pfouts, "A Theory of the Responsiveness of 
Hours of Work to Changes in the Wage Rate/' Review of Economics and 
Statistics , XL (1958), pp. 116—21. 
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The point of the preceding paragraph can be developed more 
finely by considering the equation 


(41) 


dx. „ , dx, dx, dx, 

* dp k oM or dw 

k JLJ k 


D D 


+ r M^^ + w(^^^ + y^^y 


the substitutions being made from equations (8), (19), (12) 
and (26). If the homogeneity condition held in the present 
formulation, the left member of (41) would be identically zero 
by Euler’s theorem on homogeneous functions; we shall see 
that this is not true in general. If we add and substract 
A(D n+2 'i/D) to the right member of (41) and rearrange terms we 
obtain 

(42) - | £***^±M + + wy 


+ r M 


^n+S,i 

D 


+ A 


D 


The first three terms in expression (42) can be eliminated by 
reference to (7) and the theorem that expansion of a deter- 
minant by the elements of one column and the corresponding 
cofactors of another column yields zero. By reference to (2) it 
follows that the second three terms of (42) reduce to s(D n+Zti /D). 
Consequently we can write 


(43) 


— dx, mjr dXi dx , dx, 

Zfi' + M—t + r—' + w-l 
it dp k dM dr dw 


D 


= s- 


~D~ 


+ rM - 


D 


n+3,i 


+ A- 


D, 


n+2,i 


Substituting from (19) and (24) gives 
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(u] VA , 3 + ze>— ‘ = -^. 
» 3r 3® r dM 


x i ds 
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3s 

dfit' 


If (44) is transformed into elasticity form, we obtain 


( 45 ) IVik+Vir + Viw 
a 




\ rM ^ m ^ p iXi 


»?<,<)• 


While (44) and (46) lack the straight-forward simplicity of the 
homogeneity condition, they do bring out certain points. First 
it may be noted by reference to these equations that the homo- 
geneity condition clearly does not hold under the present as- 
sumptions; further it is difficult to visualize an assumption with 
intuitive appeal that would restore the homogeneity condition. 
Secondly, the influence of savings in upsetting the homo- 
geneity condition is clearly exhibited in the righthand members of 
these equations. Observation of the righthand member of (45) 
shows that if savings were zero, the homogeneity condition would 
almost be restored. But the full homogeneity condition would 
not be restored because the elasticity of the i th good with respect 
to the stock of earning assets would still be missing from the 
lefthand member of (46). 

A similar development can be made for equation (36), the 
equation expressing the demand of the individual for hours of 
work. In elasticity from the result is 


(46) ^ Vt/k ~f~ Vvt H“ Vyw S Vsm) + ' 

Reference to (46) shows that the homogeneity condition does 
not hold in general for (36). Again the importance of savings 
in removing the homogeneity condition with reference to the 
demand for hours of work is brought out in the righthand 
member of (46). 

In the same fashion equation (37) can be used to show the 
absence of the homogeneity condition in the demand for savings. 
When exhibited in elasticity form the result is 

(47) 2 Vsk + Var + Vsu — • 

* rM s D 


Again the homogeneity condition is clearly not satisfied. With. 
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regard to this righthand member of (47) it may be observed 
that the ratio of cofactor to determinant will be negative since 
principal minors of the determinant in (7) will alternate in 
sign; at the same time X will be negative. Consequently the sign 
of each member is the same as the sign of s. If savings are 
positive each member of (47) is positive, but if dissaving takes 
place each member is negative. 

Effects of Saving and Dissaving 

By making certain assumptions that appear to be reasonable 
on empirical grounds, we can obtain some additional results 
from (47). We assume that the elasticity of savings with respect 
to price is negative but that the elasticities of savings with 
respect to wage rate, rate of return and stock of earning assets 
are positive. It follows from (47) that if 

$ > 0 , 

then 

s 

(^®) V 8r H" Vst o VsM ^ 2 Vsk’ 

Under the assumptions about the signs of the elasticities, the 
first two terms in the lefthand member of (48) are both positive 
and the third term has a negative effect, while the right member 
is positive, consisting of a negative term prefixed by a minus 
sign. Then (48) implies that savings respond more heavily to 
changes in rate of return and in wage rate than they do to prices 
and to stock of earning assets weighted by savings as a propor- 
tion of income from earning assets. In other words, rate of return 
and wage rate dominate the weighted value of stock of earning 
assets and prices in affecting savings. 

It also follows from (47), that if 

5 < 0 , 

then 

(49) Vsu, + i1bt-^V s m<-Z 

Continuing the assumptions concerning the signs of the elasticities 
of savings, the first two terms of (49) are positive, the third is 
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positive but prefixed by a minus sign and the last term is negative 
and prefixed by a minus sign. Under these conditions it follows 
that dissavings respond more strongly to price changes than to 
changes in wage rate, rate of return and stock of earning assets 
with the appropriate weight. Consequently when dissaving takes 
place, price influences dominate all other influences. Surely 
this has a strong intuitive appeal since one would expect dis- 
saving to take place at an income level so low that prices of 
consumers’ goods dominated all consumers’ actions. 


IV 


New and Modified Slutsky Conditions 


The Slutsky conditions, which constitute restrictions on the 
demand equations, are modified from their usual form when hours 
of work and savings are included in the utility function. This is 
easily seen by reference to equation (15), keeping in mind the 
symmetry of D. From (15) we may write 


(50) 


3 ^ 
dp T 


Tc 


dx i 

dr 



+ 3* 


da* 

dr 


And from (22) we may write 
(51) 


dx t dx { dx h dx k 

r df k + Xk dM = r df i + Xi m' 


In addition to the modified Slutsky conditions of (50) and 
(51), we can find another restriction on the demand equations 
similar to the Slutsky conditions. From comparing equations 
(16) and equation (29), we see that it is possible to write 


(52) 






Equation (52) is a Slutsky-type restriction and presumably 
under ideal conditions would be capable of being tested for 
refutation. Alternatively, from equations (23) and (32) it is 
easily seen that we can say 

dx t dx { dy dy 

r dw V dM r dp~ Xi dM' 


(53) 
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Some Restrictions on Elasticities 


In addition to the Slutsky-type of restriction, we can bring 
forward other restrictions concerning the elasticity of the 
dependent variables with respect to rate of return and the stock 
of earning assets. Put briefly, this class of restrictions says that 
the elasticity of any dependent variable, i.e. the x’s y y or 5 , with 
respect to the rate of return equals the elasticity of the same 
variable with respect to the stock of earning assets. 

Thus from equations (12) and (19) we notice that 

1 dXi 1 dx i m 
~ dM ~ M ~dr ’ 


consequently 

(54) 


M d%i r dx { 

~x’ i m = 'x i 'dr =r,iM = Vi ' 


(i = 1, . . ., n). 


From a comparison of (13) and (20) it is clear that we can also 
state that 


(55) rj„ = rj vM . 

It is equally obvious from (14) and (21) that 

( 56 ) Vsr = VsM- 

The restrictions on the demand equations that are represented 
by equations (54), (55) and (56) are restrictions on individual 
demand equations at the same level as the Slutsky conditions. 
I.e. they arise from the fundamental assumptions regarding the 
utility equation and the postulate of maximization. In an em- 
pirical sense they are both less complex and, probably, less 
important than the Slutsky conditions; empirically they appear 
to be less restrictive on the variables than the Slutsky conditions. 
In substance they merely state that the effect of a change in 
income from earning assets on the quantities of goods, hours of 
work or savings is the same whether the increment results from 
a change in rate of return or a change in stock of earning assets. 


The Mjcro-Propensity to Save 

By employing hours of work and savings as variables in the 
utility function, we gain a more realistic prototype of the 
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consumer and additional restrictions on the individual demand 
equations. In addition the marginal propensity to save takes 
on meaning in the case of an individual. That is, we can now 
write 

ds d s ds __ ds , 

(57) ds^—dw + —dr + —dM + ^ — dp i 

v ' dw dr dM y op t 

as a meaningful statement in which substitutions can be made 
and which is related to other aspects of the consumer’s behavior. 
An individual propensity to save relation is not possible in the 
orthodox formulation of utility. 

Indeed the micro-approach yields a more comprehensive 
statement of the motives behind the marginal propensity to 
save because it includes the effects of relative prices in the last 
term of the right member of (57). That is, the macro-approach 
usually shows the propensity to save as a function of income, 
but not of relative prices. The response of savings to changes in 
income, but not prices, at the micro level is given by 

. ds ds ds 

(58) ds = — • dw + — dr + — - dM. 

dw dr dM 

But equation (58) is strictly valid only if there have been no 
changes in the prices of the goods consumed, or if the last 
term of (57), through a coincidence, is equal to zero. The absence 
of general validity for equation (58), coupled with the fact that 
the micro propensity to save is a combination of many micro 
propensities to save, suggests that a part of the difficulties in 
empirical studies of the macro propensity to save or to consume 
may be due to their ignoring relative prices as independent 
variables. 
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On Economic Dynamics 




MILTON FRIEDMAN 


In Defence of Destabilizing Speculation 1 ) 


Two propositions about private speculation are widely held: 
first, that .speculation is in fact often destabilizing, in the sense 
that it makes fluctuations in prices wider than they would 
“otherwise” be; second, that destabilizing speculation neces- 
sarily involves economic loss. This pair of propositions underlies 
much current opinion about commodity policy — where they 
lead to support for “buffer stocks” and similar plans — and 
about balance of payments policy — where they constitute a 
chief criticism of floating exchange rates. 

This note is not intended to be an exhaustive analysis of this 
pair of propositions, or of speculation in general. Its purpose is 
much more limited: to point out that the second proposition is 
invalid, that destabilizing speculation, though it may in some 
cases lead to economic loss, may in others confer economic 
benefit. The empirical generalization about the prevalence of 
destabilizing speculation — which is what gives the theoretical 
proposition its interest — seems to be one of those propositions 
that has gained currency the way a rumor does — each man 
believes it because the next man does — and despite the absence 
of any substantial body of well documented evidence for it. 
It is a proposition that badly needs intensive empirical in- 
vestigation. My own conjecture is that such an investigation 
would show it to be unfounded. But this is simply a conjecture 
and plays no part in what follows. 

The ready acceptance of the proposition that destabilizing 
speculation is economically harmful reflects, I believe, a natural 
bias of the academic student against gambling and in favor of 
insurance. It is natural for him to regard a futures market, for 
example, as a market in which a “legitimate” producer hedges 

x ) I am indebted for comments on an earlier draft to Martin Bailey, Harry 
Johnson, James Meade, Joan Robinson, and Dennis Robertson. 
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his risks by transferring them to a “speculator”; the producer 
is viewed as buying “insurance” from the speculator. But 
granted that this is a possible and indeed likely interpretation 
of an actual futures market, it is not the only possible one. May 
such a market not be one in which the “legitimate” producer 
engages as a side-line in selling “gambles” to speculators willing 
to pay a price for gambling and knowingly doing so? And if so, 
moral scruples about gambling aside, is any economic loss in- 
volved? 

In arguing that destabilizing speculation need not involve 
economic loss I do not mean in any way to deny the usual view 
that stabilizing speculation confers benefit. In this usual view, 
the economic function of speculation is taken to be the reduction 
of inter-temporal differences in price. In a commodity market, 
for example, a speculator is viewed as performing this function 
by buying when the crop is plentiful and prices “abnormally” 
low, holding stocks of the commodity until prices have risen, 
and then selling when the crop is short and prices “abnormally” 
high. In this way, speculators transfer resources from less to 
more urgent uses. The difference between the prices at which they 
sell and buy is their margin, which must cover costs of storage 
and furnish their remuneration. The excess over storage costs 
is a payment for specialized skill in knowing when to buy and 
when to sell and perhaps also for bearing risk. 

This model takes for granted that there is a meaningful 
distinction between speculative and other transactions, that one 
can speak of what the price would have been in the absence of 
speculation. This is a point that raises many difficulties and 
requires careful examination in any full analysis of speculation. 2 ) 
We can, however, evade it for our purposes by narrowing the 
question under discussion. Consider any market in operation. 
Suppose that an additional set of transactions are made in that 
market by an additional group of people whom we shall call 
“speculators” or “new speculators”. We shall then deal only 
with the question whether this additional set of transactions 

s ) See, for example, the comments by W. J. Baumol, "Speculation, Profi- 
tability, and Stability," Review of Economics and Statistics, August, 1957, 
pp. 263—71. 
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increases the fluctuations in price and, if it increases them, 
whether it involves an economic loss or confers a gain. By dealing 
in this way with a change in the amount of speculation, we can 
avoid the troublesome intellectual problem of defining zero 
speculation without any essential loss in generality. We shall 
make one further assumption to evade a troublesome problem: 
namely, that the activities of speculators do not affect the 
quantities demanded and supplied by other participants in the 
market at each current price. This implies that there is a well- 
defined price that will clear the market at each point in the 
absence of speculation and that this price is not affected by 
speculation. 

With these assumptions, it is clear that, if carrying costs are 
neglected, our model implies that speculators gain if they reduce 
inter-temporal differences in price, and lose if they widen such 
differences. Speculators can fill in the troughs of price move- 
ments only by buying net when prices would otherwise be low; 
they can flatten out the peaks only by selling net when prices 
would otherwise be high; unless they carry this so far as to 
reverse peaks and troughs, they gain by the difference. Con- 
versely, speculators can make fluctuations wider (in the same 
direction) only by selling net when prices would otherwise be 
low and buying net when prices would otherwise be high. But 
this means that they sell at a lower price than they buy and so 
make losses. Our model therefore implicitly defines stabilizing 
speculation as speculation yielding gains (carrying costs aside) 
and destabilizing speculation as speculation yielding losses. 
The circumstances, if any, under which this will not be true 
deserve extensive examination in a full analysis of speculation 
but can be accepted for our more limited purposes, which is 
simply to show that destabilizing speculation need not involve 
economic loss, not that it cannot do so. 3 ) 

3 ) See Ibid, for one such fuller examination. It will be clear that our as- 
sumptions rule out the main case there considered. 

Baumol also considers a special case corresponding to our assumptions 
(pp. 26$ — 70). His own conclusion is ambiguous but only because in judging 
the profitability of the speculation he does not require it to be carried through 
to completion, in the sense that the speculators end up in their initial position 
with respect to the holdings of the speculative commodity. 
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One reason why actual speculation might not conform to the 
model described in the preceding three paragraphs is avoidable 
ignorance. By no means all actions that are mistakes when 
viewed ex post fall into this category. If I wager even money 
that a coin will come up tails and it comes up heads, I clearly 
have made a mistake ex post, in the sense that I shall wish that 
I had chosen heads. If, in addition, I discover by an examination 
of the coin that it has heads on both sides or in some other way 
is biassed toward heads, and if I could have made this examina- 
tion before the wager, then I have also made a mistake ex ante. 
On the other hand, if such additional examination gives me no 
more reason than I had before to question my belief that the 
coin is fair, then my initial choice of tails may be bad luck but 
cannot be described as a mistake. The distinction between the 
two cases is, in principle, whether I would have acted differently 
in advance of the actual toss if I had had the knowledge I 
gained after the toss except for the actual outcome itself, i.e., 
if I had had the knowledge that it would have been possible for 
me to have had before the toss. In the same way, the mere fact 
that speculators make losses over a particular period and in fact 
destabilize prices for that period is no evidence either that the 
losses could have been avoided given the general state of knowl- 
edge when the speculation was entered into or that speculation 
is on balance destabilizing in any more fundamental sense. 

If destabilizing speculation does arise from avoidable ignorance, 
it must be granted immediately that there is an economic loss. 
The loss is borne primarily by the speculators, though if the 
operation is sufficiently large, second order effects on others 
may not be negligible in the aggregate. It may be noted in passing 
that insofar as this case justifies any action by government, it 
justifies solely the distribution of knowledge. Suppose private 
speculation is destabilizing because ignorant speculators behave 
against their own interests, but speculation by government 
officials trying to achieve the same end as private speculators 
would be stabilizing because of greater knowledge. The appro- 
priate solution is then for the government officials to make their 
knowledge available either by providing the information on 
which their price forecasts rest or by making and publishing 
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the price forecasts themselves. If these are more accurate, on 
the whole, than the forecasts private speculators would otherwise 
use, private speculators have a strong incentive to act in ac- 
cordance with them and in the process will produce the same 
results as government speculation in accordance with the same 
forecasts. If the forecasts are not more accurate, they will tend 
to be disregarded and no great harm will be done. 4 ) 

To see how destabilizing speculation can arise without avoid- 
able ignorance, let us start with a commodity market which 
is in operation. Suppose that there exists independent gambling 
establishments in which all gambling takes the form of betting 
on the future price of the commodity in question — say rubber. 
The people who bet on the price of rubber in the hypothetical 
gambling establishment do not buy or sell rubber, and neither 
do the people who run the establishment. Their operations 
therefore have no direct effect on the price of rubber; the rubber 
market simply takes the place of the roulette wheel at Monte 
Carlo. 5 ) We may suppose the proprietors of an establishment to 
operate solely as brokers, engaging in no gambling themselves 
but being paid a fee for providing facilities and bringing together 
people willing to take opposite sides of a common wager. And we 
suppose throughout that the people engaging in the gambling 
do so deliberately and are reasonably well informed: they like 
to gamble and are willing to pay a price to do so. Let us put to 
one side any moral objections to gambling, and suppose that 
the gambling services are provided under competitive con- 
ditions. The proprietors of the gambling house are then devoting 
economic resources to producing services to satisfy the wants 
of consumers, who are willingly buying the services and paying 
a price equal to the cost of the alternative services that could have 
been obtained with the same resources. Clearly there is economic 

4 ) One case in which publication of forecasts or the equivalent might be 
especially called for is if the authorities feel it necessary to suppress some 
relevant information for security reasons. They might be able to offset the 
effects of such suppression on the judgments of traders by issuing price forecasts. 

5 ) Th$re could be an indirect effect if, for example, information about the 
odds ruling in the gambling transactions altered the expectations about future 
prices of the people trading on the spot market and so changed amounts diverted 
to stocks. 
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gain rather than loss through the operation of the gambling 
house. 6 ) 

Of course, there may in fact be no demand for this service at a 
price sufficient to call it forth. Whether there is depends on the 
preferences of the public for gambling of various types, the kind 
of gambling provided by the rubber market — that is, the 
probability distribution of the price of rubber — the alternative 
sources of gambling services, their cost and character, and so on. 
The willingness of people to buy lottery tickets at less than their 
actuarial value even though they know full well the probabilities 
of prizes of various size is sufficient evidence that people are 
willing to pay a price to bear at least certain kinds of risks, to 
be subjected to increased uncertainty. In any event, our concern 
is not with the likelihood that gambling establishments of the 
kind described would be profitable but only with the consequences 
if they were. 

Consider an individual who wants to bet that the price of 
rubber will be higher a month from now than it is now. He can 
place such a bet in the gambling establishment at some odds and 
subject to paying a commission to the proprietors. An alter- 
native way in which he can subject himself to the same un- 
certainty is to buy rubber in the market, store it for a month, 
and than sell it: he can accumulate positive stocks. The cost 
in this case is the cost of storage over the month. Similarly an 
individual who wants to bet that the price will fall can ac- 
complish the same objective by selling rubber now, borrowing 
the physical commodity in order to make delivery currently: 
he can accumulate negative stocks. He may be paid for doing 
so, because he saves someone storage costs. Presumably however, 
the amount he is paid will be less than the storage costs, the 
difference being the fee for lending the commodity. And if the 
loan requires dipping into stocks needed, say, to facilitate 
production, storage costs may be, as it were, negative and he 

6 ) It will be noted that a pure futures market is very close to such a gambling 
establishment. A transaction on a futures market does not by itself have any 
effect on the spot market. It affects current price only to the extentrthat the 
price established leads to operations on the spot market and thereby to a change 
in the size of stocks carried over. This is analogous to the indirect effect described 
in the preceding footnote. 
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may have to pay to borrow the goods. (Remember that we are 
considering the effect of the actions of an additional group of 
people. Their holding negative stocks simply means that total 
stocks are less than they would otherwise be). Suppose in- 
dividuals find this alternative way of gambling cheaper. The 
gambling establishments will then disappear and the gambling 
services be provided by the rubber market. 

If purchases and sales just offset, there is no effect on current 
price and the net costs are the various commissions paid to 
transact the business. The market dealers have taken the business 
of providing gambling services away from the gambling in- 
stitutions proper. But purchases and sales need not just offset 
one another — indeed, the lack of necessity for them to do so 
may be one of the advantages of operating through the market, 
though a similar possibility could be provided by the gambling 
establishments if their proprietors “made book” rather than 
simply acted as brokers. If purchases and sales do not offset one 
another, the price of the commodity is affected. 

We have now combined the two activities: gambling on the 
price of rubber, and the rubber market proper. Given com- 
petitive conditions, this combination will occur only if it is a 
cheaper way to provide gambling services and so in this respect 
represents increased efficiency in the use of resources. If the total 
expenditures of the gamblers on gambling services exceeds the 
commissions involved, this is equivalent to saying that, viewed 
as a body of speculators, they engage in destabilizing speculation. 
But their losses are someone's gain. In the first instance, they 
will be the gain of the initial participants in the rubber market. 
Operating on the rubber market has now become a more at- 
tractive business since one can now engage in joint production, 
producing gambling services as well as trading services. The 
result will be to attract more people into the activity. Temporary 
gains will be competed away and the trading margin proper 
reduced, so raising the average net price of rubber to the producer. 
But this in turn will stimulate output and so reduce the average 
net price of rubber to the consumer. The provision of gambling 
services is now being rendered jointly by the producers of rubber, 
the middle men, and the consumers of rubber. In return for 
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wider fluctuations in price — which are required to provide the 
gamblers or speculators with the uncertainty they want to bear 
— the producer gets a higher average price and the consumer 
pays a lower average price. 

Any individual producer or consumer who disliked the wider 
fluctuation of prices could insure himself against it. But, given 
our assumptions, it cannot be that producers and consumers 
would be willing to pay more on the average than the difference 
between old and new average prices to insure themselves against 
the wider fluctuations. For this would contradict the initial 
assumption that there was a demand for the services of the 
gambling establishments at a positive price. The people who 
were willing to make bets on the price of rubber — willing to 
assume risks — would then have found that they were paid, 
instead of having to pay, for doing so. Instead of the market 
being supplemented by gambling institutions, it would have 
been supplemented by insurance companies, insuring people 
against the fluctuations in prices. 

I grant readily that this picture of a world in which increasing 
fluctuations in the prices of commodities is a service that com- 
mands a positive price is hard to accept as a valid description 
of the actual world; not so much because people are not willing 
to pay for gambling — they clearly are — but because there 
seem to be so many cheaper ways of producing the gambles 
that people want to buy, though it must be noted that some 
of these are illegal in many countries. However, this is the picture 
that is implicit in the acceptance of the empirical generalization 
that destabilizing speculation often occurs in practice, except 
for such destabilizing speculation as is attributable to avoidable 
ignorance, or as may be consistent with deviations from our 
initial assumptions. 

Whether particular services command a positive or negative 
price — are consumption services or productive services — is not 
determined by physical or technical considerations alone; it 
depends also on the tastes and preferences and the capacities 
and opportunities, of the community at large. Paintings fence 
is generally regarded as a productive service that must be paid 
for, as an activity yielding disutility, and so the price of painting 
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a fence is generally negative; Tom Sawyer was able to reverse 
this attitude and to make it an activity yielding utility; he was 
able to charge a positive price for the privilege of painting a 
fence. This is the essential issue involved in judging speculation. 
Is bearing uncertainty a service that must be paid for? Or a 
privilege for which people are willing to pay? Is speculation the 
rendering of a productive service that commands a reward? 
Or is it a means of gaining utility on which people spend part of 
their income? If it turns out to be the second rather than the 
first, is this any reason for regarding it as involving economic 
loss? Does not the tendency to do so simply reflect the preconcep- 
tions of the academic? 



HAROLD T. DAVIS 


The Influence of Interest Rates 
on Time Series of Price 


1. Introduction . 

In a paper that has become classic 1 ) Harold Hotelling discusses 
the role of differential equations in the application of mathematics 
to the problems of the economist. He points out both the analytical 
advantages in their use and the statistical dangers which appear 
when they are applied. He admires the superb formulation of 
the astronomer with his exact equations and his accurate deter- 
mination of empirical constants. This he contrasts with the less 
satisfactory situation of the economist, who has few rigorous 
laws to guide him and who must rely upon correlations for the 
computation of his parameters. But Hotelling points out that 
the astronomer has had the great advantage of a "tyrannical 
sun" which dominates the motion of the planets, while the 
economist must study a system in which the influence of any 
single factor is submerged in a complex of factors, none of which 
is dominant. 

A problem in point is that of the role of the rate of interest in 
the dynamics of economic time series. This factor has been 
studied since the time of Adam Smith and David Ricardo. 
Volumes have been written about its influence. Irving Fisher 
devoted a considerable part of his scientific career to its study. 
The names of Walras, Pareto, Keynes, Snyder, Cassel, and many 
others appear in the bibliography of the subject. But in spite 
of this vast study, it is a curious matter to observe that one 
still finds it very difficult to measure with any degree of exact- 
ness the influence upon the business cycle of a change, such, 
for example, as that of the rediscount rate made by the board 
of the Federal Reserve System. 

x ) “Differential Equations Subject to Error, and Population Estimates/* 
Journ. Amer . Statistical Assn., Vol. 22 (1927), pp. 283 — 414. 
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One reason for this difficulty is to be found in the observation 
that this factor seems to exert its influence implicitly through 
a utility function, rather than as an explicit variable. One knows 
from abundant experience that an individual under the duress 
of severe financial need will agree to the payment of usurous 
rates of interest, but if he is in the possession of blue-chip 
collateral, then he is quite willing to argue the rate with his 
banker. What is true for the individual is also true for business. 
This is readily observed from the fluctuations in the interest 
rates on 4 to 6 months commercial paper, which has varied in 
this century from 7.37 % in 1920 to less than one per cent in 
recent years. Thus the rate of interest is observed to be a function 
of the individuars or the businessman's money-utility. 

For this reason it has seemed to the author that some light 
might be thrown upon the problem if the effect of the rate of 
interest upon economic time series was approached strictly from 
the point of view of the utility function. Fortunately for this 
investigation a very promising beginning was made by G. 
Tintner in 1938, 2 ) who introduced the interest rate into the 
familiar problem of the maximization of utility in a price- 
commodity space, subject to the restraint of the budget equation. 
A problem proposed by him, but not solved, was the investigation 
of second order conditions to establish criteria for the existence 
of a maximum for the utility function thus extended to include 
the rate of interest. The model suggested for such an investigation 
was that of Hotelling published in 1935. 3 ) 

In the present paper the author proposes to explore the further 
consequences of Tintner's analysis and to integrate it with a 
dynamical theory of prices introduced by him in 1941. 4 ) 

2. A Dynamical Theory of Prices Which Includes the Effect of 

Interest 

We shall give a brief summary of the principal result obtained 

2 ) “The Maximization of Utility over Time," Econometrica, Vol. 6 (1938) 
154—158. 

s ) “Demand Functions with Limited Budgets,” Econometrica, Vol. 3 (1935) 
pp. 71 ff. 

4 ) Analysis of Economic Time Series, (Bloomington, Indiana, 1941), pp. 
370—385. 
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by Tintner. Limiting himself to three quantities: x, y, and z, 
he denoted the amount of each consumed at time thyx(t), y(t), 
and z(t) and the corresponding prices by p{t), q(t), and r(t) 
respectively. He then introduced a utility function, which we 
shall denote by the symbol U, and a force of interest p(t). The 
utility was then maximized subject to the budget restraint 

(2.1) E = x{t)p{t) + y{t)q{t) + z(t)r(t). 

The equilibrium equations then assume the form 

u' v u' r r* i 

where 1 is the marginal utility of money at time t = 0. The 
novel element introduced into the familiar problem is the function 
involving the force of interest. 

We shall now introduce this result into a dynamical theory of 
prices which the author proposed some years ago. The arguments 
in support of the theory have been given elsewhere and will not 
be repeated here. Principal support for the theory is found in the 
empirical evidence of its effectiveness in describing observed 
phenomena associated with the movement of prices, in particular, 
the behavior of stock prices, including the bull market of 1929, 
the subsequent depression, and the activity of the series in the 
period of recovery. But the introduction of the factor of interest 
into the theory as proposed here is not for the purpose of giving 
further support to the theory itself; it is rather to see what the 
effect would be when such an introduction is made. The con- 
sequences seem to have some interest, at least from the mathe- 
matical point of view, whatever one may think of their actual 
economic significance. And as such they are offered to the reader. 

The dynamical theory of prices is based upon the following 
integral over some time interval: 0 ^ t ^ T: 

r T 

(2.3) /= j (U-kA-mB)dt, 

where U is the utility function, A is a measure of the erratic 
element in the economic system, and B is a measure of the factor 
of supply and demand. The quantities k and m are constant 
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parameters of positive sign. A realistic definition of A was found 
to be the sum of the products of the time-rates of change of goods 
consumed by the time-rates of change of their prices. Thus, in 
the notation used above for the three-commodity system of 
Tintner, we write: 

(2.4) A = x'(t)p'(t) + y'{t)q'(t) + z'(t)r'{t ). 

Although this bilinear form, considered only mathematically, 
may have either sign, the economic argument shows that it is 
intrinsically positive. Hence its introduction into the integral 
(2.3) reduces the value of J. 

The quantity B is defined to be half the sum of the squares 
of the differences between the quantities of good consumed and 
the quantities produced. If we denote by u(t), v(t), and w(t) 
the quantities of x y y, and z produced, we write 

(2.5) B = H(x — u) 2 + (y — v) 2 + (z — w) 2 ] y 

the factor \ being introduced only for convenience. 

Although there is a tendency toward equilibrium between 
x, y y and z and u, v, and w, their differences may have either sign. 
If we define u, v y and w as current inventories at time t y and x, 
y y and z as the amounts currently demanded, then, in times of 
depressed business, (x — u), {y — v) y and (z — w) are often 
negative; but when business is active, as in recent years, the con- 
trary is true. The value of B y however, is never negative and 
when this factor is introduced into (2.3) the value of J in general 
is reduced. 

The primary assumption is then made that prices are deter- 
mined in such a manner that the integral J is maximized over 
time. This leads us to the following three equations: 

U’ x - kA’ x — mB‘ a — — k 4 A',., 

at 

U' v -kA' v -mB' v = -k~A^ 

at 

U'„ - kA' t — mB' e = — k^-A'j. 


( 2 . 6 ) 
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If we now introduce the values for U x , U v , and U z from (2.2) 
and the values for the derivatives of A and B from (2.4) and 
(2.5), we obtain the following equations: for the determination 
of the prices p(t), q(t), and r(t): 

k-rr + ^ ex P I" - f p{ s )ds P = m { x ~ u )> 
at 2 L Jo J 


(2.7) 


+ A exp [~J p(s)ds^q = m(y — v), 

k — + X exp f p(s)ds\r — m(z — w). 
dt 2 L Jo J 


Without essential loss of generality we can assume that 
k — 1. If we replace the right-hand members of (2.7) by the 
symbols E(t), F[t), and G(t), and if we use the abbreviation: 
R(t) = exp [— Jo p{s)ds], then the equations can be written: 

d Lt + XR(t)p{t)=E{t), 


( 2 . 8 ) -^ + mt)q(t) = F(t), 

^ + XR(t)r(t) = G(t). 

It is the purpose of this paper to explore the consequences 
of this system of equations and to observe the difference in the 
behavior of prices between the two situations: (1) when R(t) 
is present in the system; and (2) when R(t) is replaced by a 
constant. 

It might at first sight appear that the problem as it is for- 
mulated in equations (2.8) is oversimplified, since there is no 
explicit intercorrelation of the three prices as is usually the 
case in real markets and particularly for commodities which 
are competing. Generality can be restored to the system, however, 
by adjoining to (2.8) a functional connecting x, y, and z and 
their marginal rates of substitution. The obvious complexities 
which are introduced will be avoided in this paper by 
assuming independence between the variables. The general 
problem is thus reduced essentially to the case of a single market. 
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3. Behavior of Prices Under a Constant Positive Force of Interest 
— Integration of the Differential Equation. 

We shall begin with a simple market involving one commodity 
and its price as defined by the first equation in system (2.8), 
that is to say, with the equation: 

(3.i) ~ + mt)m = E(t). 

Let us assume that the force of interest is a positive constant 
and equal to p, from which we get R(t) — e~ pt . Equation (3.1) 
then assumes the form 


(3l2) ^ + =E(t). 

The first problem suggested by this equation is to find the 
behavior of the price when E(t) is zero, that is to say, when 
the supply factor equals the demand factor. We are thus led 
to the solution of the equation 

d 2 P 

( 3 - 3 ) — -rAe-'‘J?(0 = O, 


where P{t) is used to denote the equilibrium price. 

This equation can be integrated in terms of Bessel functions. 
To accomplish this we assume a new independent variable 


(3.4) 


z = 


Wx 




P 


in terms of which we compute 


dP 

dp 

*1 - X p 2 z 


, d I\ 

~dt ~ ' 

dt* ~ iPA 

\ dz 2 

V dz) 


When these derivatives are substituted in (3.3) and obvious 
simplifications made, the following equation is obtained: 


(3.5) 


d 2 P 


dz* 


dP 

dz 


+ zP = 0 . 


which is recognized as the simplest case of Bessel’s differential 
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equation. Its general solution is the function 

(3.6) P = CJ 0 (z) + DY 0 (z), 

where J 0 (z) and Y 0 (z) are respectively the Bessel functions of 
first and second kind of order zero. 

The general solution of equation (3.3) thus assumes the form: 

(3.7) P(t) = CJ 0 (~^ e ~ ipt ) + DY o ■ 

In order to interpret this equation, we first observe that 
as t oo, we have 

lim P(t) = C/ 0 (0) + DY 0 ( 0). 

t~+co 

But since J 0 (0) = 1 and Y a (0) = — oo we must set D — 0 in 
order to obtain a realistic description of the equilibrium price. 
We are thus led to a consideration of the function 

(3.8) P(t) = CJ 0 (Ke~ ipt ), 

where we use the abbreviation: K — 2y/./'p. 

As is well known the function J 0 (z) has the following asymp- 
totic approximation provided z is sufficiently large: 

(3.9) J 0 (z) ~ ]/ — cos (z - in). 

f nz 

Substituting Ke~* pt for z in this formula, we obtain the 
following asymptotic value for P(t): 

(3. 10) P(t)^c]/~ e*' 4 cos {K e’^ 4 - \n), 

» uK. 

which holds only when the argument is large. 

But if we assume that t = 0 is the origin of our time-scale, 
then the largest value that z can have is K. Since K oo as 
p — > 0, it is clear that large values of the argument are to be 
■obtained only when p is small. 

We can now formulate a few of the characteristics of P(t) 
as follows: 

(a) If K is sufficiently large, that is to say, if p is small, then 
.P(t) is an oscillating function in the neighborhood of the origin. 
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(b) As t increases the number of zeros within any fixed interval 
decreases as t increases and this number ultimately reduces 
to zero. 

(c) The relative maxima of P(t) increase as t increases. 

(d) The distance between relative maxima increases as t 
increases. 

(e) P(t) finally approaches C asymptotically from below. 

4. Behavior of Prices When the Force of Interest is Very Small . 

A question of considerable analytical interest is invoked when p 
is zero or very small. If p is actually zero, then equation (3.3) 
reduces to the following: 

d 2 P 

(^•1) + m) = o. 

which has the solution: 

(4.2) P(t) = a sin (Vlf + b). 

But is it very interesting to observe that P[t) as defined by 
(3.8) does not reduce to P(t) as given by (4.2) when p 0, 
since K -> co, and the function approaches zero for all values 
of t. 

It is important, however, to have a solution of equation (3.3) 
when p is small, that is to say, a solution intermediate between 

(4.2) and (3.8). Such a solution can be developed formally as a 
power series in p, that is, in the form: 

(4.3) P,(f) = a sin (VI* + b) + p^t) + p*<f> 2 {t) + . . .. 

Since secular terms, that is to say, terms of the form t n sin VI £ 
and t n cos s/7.t, are thus introduced, one has reservations about 
the convergence of the series. But it is not difficult to show that 
for a limited range of t and for p sufficiently small the difference 
between P(t) as defined by (3.8) and the first two terms of P 0 (t) 
can be made arbitrarily small. 

Aftei; some effort the value of $ x (f) was finally obtained 
and P 2 (t) written in the following somewhat more convenient 
form: 
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(4.4) 


P 0 (t) = a 0 [cos at -f — (~sin at+at cosat+at 2 sin at) + . . .J 
+ a x [sin at + ■— (t sin at — at 2 cos a?) + . . ■ J > 


where a 0 and a x are arbitrary constants and a 2 = A. 

In order to compare P(t) as defined by (3.8) with P 0 (2), we 
equate both functions and. their first derivatives at t = 0. We 
thus find 


(4.5) a 0 - CJ q (K), a x - ). 

where J x {z) is the Bessel function of first order. 

The magnitude of the error \P(t) — P 0 (£)| i s shown by the 
following numerical approximations: 

Setting C = 100, A = 1, p = 0.1, t — 2, we compute: 

P(t) = 100 / 0 (18. 09675) = 0.48167, P 0 (t) = 0.49995, 

which is quite satisfactory, considering the comparatively large 
value of p, namely, a force of interest equal to 10 %. 


5. Behavior of Prices When the Force of Interest is Negative. 

There is nothing inherent in the theory of Tintner to prevent 
us from assuming that the force of interest in R(t) is negative. 
That is to say, we can write: R(t) = e^, p > 0, and thus replace 
equation (3.3) by the following: 

(5.1) 0, 

where n(t) denotes the price at time t. 

The meaning to be attached to this assumption is readily 
seen from Tintner's fundamental equation, namely 

(5.2) I 5 + Vu(l + V-i) = Ej + Sf, ? = 1, 2, . . ., n f 

s n = 0, where I is the expected money income, E i the expected 
expenditure, and s 3 the expected savings at time j, and s^ x the 
savings planned for time j — 1. These savings are accumulated 
at an anticipated rate of interest i^ v But as one well knows 
savings are not always invested at a profit, in which case i^ x is 
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negative. The theory leads to the interesting conclusion that the 
anticipated marginal utility of money at time j — 1 may be 
written 

(5.3) X = ^/[(l + h.)(! + H) . . . (1 + i|-i)L 

where X 1 is the observed marginal utility at the origin of the time 
sequence. 

Assuming that the rates of interest are positive over a given 
period of time, then (5.3) can be replaced by the marginal 
utility: X = A 1 e~* >i ; but in the contrary case, if the rates of 
interest are negative, then the utility becomes: X — X^ 1 . 

Before considering the actual economic consequences of this 
assumption, let us first consider the solution of equation (5.1). 
Observing that Jq{—z) = J 0 {z), we obtain the desired solution 
by replacing p by — p in equation (3.8). We thus have 

(5.4) n(t) = CJ 0 (Ke*<*), 

where K = 2VJ.jp and C is an arbitrary constant. 

From the properties of the function J 0 (z ) we readily obtain 
the following characteristics of n(t): 

(a) n{t) declines in an oscillatory manner from an initial value 
7c(t 0 ) and the frequency of its oscillations increases as t in- 
creases. 

(b) The maxima of n{t) are asymptotic to the function: 
CV 2jnK 

(c) 7i(t) approaches zero as t oo. 

6. Graphical Representation of the Functions. 

Before considering the economic realities of the theory which 
we have described, let us examine the graphical representation 
of the functions for a few special values of the parameters. 

Case 1. If p> 0, then we have: P(t) = CJ 0 (Ke~"i pt ), where 
K = 2VX/p. Let us introduce arbitrarily the values: C = 10, 
X = 0.09, p = 0.04, whence K = 15. This function we shall 
consider over the range: 0 t ^ 150, that is to say, we shall 
evaluate J 0 (z) between 2 = 15 and z == 0.75, where z = 15 e” 0 - 02 *. 
Values of P(t) over this range are contained in the following 
table: 
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t 

z 

p{t) 

t 

z 

p(t) 

0 

15.0 

-0.14 

60 

4.5 

-3.21 

10 

12.3 

-1.11 

70 

3.8 

-4.01 

20 

10.0 

-2.46 

80 

3.0 

-2.60 

30 

8.3 

1.09 

100 

2.0 

2.07 

40 

6.8 

2.89 

125 

1.23 

6.53 

50 

5.5 

0.10 

150 

0.75 

8.64 


The maxima, minima, and zero values of P(t) over the given 
range are contained in the following table: 


t 

z 

PW 

t 

z 

p(t) 

0+ 

14.93 

0 

38.0 

7.02 

3.00 

6.1 

13.32 

2.18 

50.0 

5.52 

0 

12.0 

11.79 

0 

68.5 

3.83 

-4.03 

19.3 

10.17 

-2.50 

91.5 

2.40 

0 

27.5 

8.65 

0 


1 



From these values the graph of P(t) is now constructed between. 
t = 0 and t = 150 as shown in Figure 1. In this same figure 



-8 
- 7 
-8 
-9 
- 10 


Fig. 1. 
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there are shown also the graphs corresponding to p = 0.03 and 
p = 0.06, which were similarly computed. 

We observe from these figures that P(t) for positive values 
of p oscillates about the base line, but ultimately approaches 
asymptotically the line P = 10. We also see that this value is 
approached more rapidly the larger p is taken and that at the 
same time the number of oscillations diminishes. It is also worthy 
of note that as p approaches zero the curve tends towards a sine 
curve, but as the number of oscillations increases the amplitude 
of the maximum values diminishes. 

Case 2. If p < 0, then we are concerned with P{t) =CJ 0 (K e&*). 
Introducing the arbitrary values: C = 10, = 0.02, p = 0.04, - 

we consider this function over the range: 0 t ^ 150. In this 
case P(t) decreases in a sinusoidal pattern from an arbitrary 
initial value of 10 J 0 (l) = 7.65 to a limiting value of 0. This 
curve is shown in Figure 2, in which is also exhibited the graph 
corresponding to p = 0.06. 



Fig. 2. 

We now observe that the number of oscillations increases as 
p is increased and that the amplitudes of the maximum values 
diminish more rapidly. The curve which is asymptotic to these 
maxima, for p = 0.06, is also shown in the figure. Its equation 
is really found to be the following: 

y = cl/JL e-fc* = 9.772 e" 0 - 016 *. 
f 7lK 
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7. Economic Applicability of the Theory . 

It not infrequently happens that a pleasing mathematical 
formulation of an economic problem may actually have little 
application to the phenomena which is proposes to explain. 
Either the parameters may resist statistical estimation; or, if 
they can be determined, the real pattern and the mathematical 
pattern may be at variance with one another. It is thus a matter 
of some importance to put any set of equations to the test of 
reality. 

In the present case we can find at least one period where the 
present theory appears to have applicability, namely, the years 
between 1917 and 1935. Within these dates the movements of 
the business cycle were essentially those of a free economy, 
which had been stimulated by the war economy that had preceded 
it and which had not yet been affected by the intrusion of 
government spending, high income taxes, and other regulations 
that characterized the subsequent period. Large variations both 
in the index of prices and in the rate of interest are to be observed. 
The rise and fall of stock prices, a balanced national budget, 
and a decline in the national debt were also characteristic 
features of these years. 

But the reason why this period is particularly adapted to the 
present study is found in the observation that the principal 
criterion of Tintner's theory, the final disappearance of savings, 
is satisfied. To show this we shall assume that in 1917 a capital 
fund of $ 100,000 is invested at the current rate of interest, 

While the rate of interest may have numerous definitions, it 
will be convenient to assume this rate to be that of 4 to 6 months 
commercial paper. The income thus derived in the /-th period 
is I r From this income an expenditure is taken which we shall 
assume has a fixed purchasing value equivalent to $ 4,000 in 
1917. Thus E j = $ 4,000 P v where P is an appropriate price 
index with 1917 as base. The general price index of the Federal 
Reserve Bank of New York City seemed to be quite suitable 
for his purpose. 

In order to bring Tintner’s theory within the scope oi this 
paper it is convenient to introduce an average rate of interest, 
i A , in terms of which we can then write equation (5.2) in the 
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following equivalent form: 

( 7 . 1 ) If + (1 4 - Pj)Si-i = E i + s i> 

where we have: If = C i j + s j _ 1 i A , C — the initial capital, and 
pf = if — i A . As we have assumed above, C = $ 100,000. The 
average rate of interest, computed from the formula: (1 + i A ) n = 
(1 + %) (1 + i 2 ) . . . (1 + *„), was found to be 4.16 %. With 
these values the following schedule was then computed: 


Year 

Capital 

fund 

Price Index 
1913= 1.00 

Interest 

rate 

Pi 

h 

(1 + P*) 5 f~i 

■ 

E > 


1917 

$100,000 

1.39 

4.74% 

0.58 % 

$4740 

$ 0 

$4000 

$ 740 

1918 

100,740 

1.57 

5.87 

1.71 

5901 

753 

4520 

2133 

1919 1 

102,133 

1.73 

5.42 

1.26 

5509 

2160 

4960 

2709 

1920 

102,709 

1.93 

7 37 

3.21 

7483 

2796 

5560 

4719 

1921 

104,719 

1.63 

6.53 

2 37 

6726 

4831 

4680 

6877 

1922 

106,877 

1.58 

4.43 

0.27 

4716 

6896 

4560 

7052 

1923 

107,052 

1.65 

4.98 

0.82 

5273 

7110 

4760 

7623 

1924 

107,623 

1.66 

3.91 

-0.25 

4227 

7604 

4760 

7071 

1925 

107,071 

1.70 

4.03 

-0.13 

4324 

7062 

4880 

6506 

1926 

106,506 

1.71 

4.24 

0-08 

4511 

6511 

4920 

6102 

1927 

106,102 i 

1.71 

4.01 

-0.15 

4264 

6093 

4920 

5437 

1928 

105,437 

1.76 

4.84 

0.68 

5066 

5474 

5080 

5460 

1929 

105,460 

1.79 

5.78 

1.62 

6007 

5548 

5160 

6396 

1930 

106,396 

1.68 

3.56 

-0.60 

3826 

6358 

4840 

5344 

1931 

105,344 

1.50 

2.64 

-1.52 

2896 

5263 

4320 

3805 

1932 

103,805 

1.32 

2.84 

-1.32 

2998 

3755 

3800 

2953 

1933 

102,953 

1.29 

1.87 

1 -2.29 

1993 

2885 

3720 

1158 

1934 

101,158 

1.37 

1.14 

-3.02 

1188 

1123 

3960 

-1649 

1935 

98,351 

1.45 

0.91 

-3.25 

i 

_ 

_ 

— 

— 


It will be clear both from this table and from the graphs 
shown in Figure 3 that there is a general tendency in this period 
for interest rates and prices to move together. We shall now 
attempt to show that the dynamical theory which we have 
introduced is in satisfactory agreement with the observed 
behavior of prices. 

We .first observe that the force of interest varies from a 
maximum of 3.21 % to a minimum of —3.25 %. Between 1917 
and 1923 inclusive it has an average value of 1.45; between 1924 
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and 1929 its average is 0.029 and between 1930 and 1935 its 
average is —1.75. Corresponding to each of these periods we shall 
now assume the following values of p : p 3 — 0.0145, p z = 0.0029, 
and p 3 = —0.0175. 



If we now denote by P 1 the variation in price in the first period 
from an average value of 1.60, then the theory states that this 
variation can be represented approximately by the function: 

(7.2) P^CJ^e-w), 
and similarly for the third period by 

(7.3) Ps^/J-e-W'), 

Vs / 

where x and r' are variables measuring time from an initial 
value of zero. 

Since the marginal utility of money has presumably not 
changed throughout the time interval, the constant k is the 
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same in each function. To identify the initial value of P x with 
the observed value of the actual price index, we let C x = 1 and 
kjp x = 10. It then follows that kj\p z \ = 10(/> x /|p 3 |) = 8.29. 

Since both r and r' are variables of time, but in unspecified 
units, it is clear that if time (2) is measured in some standard 
unit such as months, we must write: r = at + b f r' = #'2 + 6', 
where a, b , and a/, 6' are constants. In fitting (7.2) and (7.3) 
to the observed data, where 2 in months varies from 0 to 204, 
we find the following transformations are necessary: r = 0.862f, 
and r' == 0.3542 + 0.413. Introducing these values respectively 
into (7.2) and (7.3), and determining C 2 = 2 from the observed 
data, we obtain the following functions as approximations to 
the series of price variations within the two periods: 

(7.4) P x = J 0 ( 10 e -0 * 00624 *), P 3 = 2 !/ 0 (12.53 e 0 - 0031 *). 

In the second period, since p 2 is very small, a sine curve is 
fitted to the data. A satisfactory representation is obtained from 
the following function: 

(7.5) P 2 = sin (0.00229 2 — 0.137). 

The excellence of the approximation thus achieved by (7.4) 
and (7.5) is readily seen from Figure 3, where the circles represent 
the values computed from these functions. 

8. Behavior of Prices Under a Constant Force of Interest and an 

Impressed Force . 

During the period which we have just analyzed in Section 7 
there is to be observed another price series which does not 
conform at all to the pattern of general prices. We refer to the 
price of industrial stocks as measured by the Dow Jones averages. 
The great difference between the two series is readily shown, 
for example, in the period between 1923 and 1929 when the 
Dow Jones average increased from 100 to a maximum of 340, 
while the general price index changed but little. A similar 
spectacular difference is also to be observed in the period between 
1950 and 1957, when stock prices, as measured by the Dow Jones 
averages, increased from a level of 200 to a maximum well over 
500. During this period general prices showed a modest increase 
of approximately 15%. 
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The orig inal intention of the author’s theory of price dynamics 
was to exp lain , first, an observed cyclical pattern in stock prices 
and, second, the abnormal rises which we have just described. 
Considerable success, measured in terms of the statistical 
agreement between theory and observation, has attended this 
explanation. Since both of the periods have been characterized 
by an abnormal business activity and since the rise in stock 
prices appears to be a direct concomitant of it, one is led naturally 
to seek a mathematical explanation in the non-homogeneous 
equation: 

(8.1) ^ + Xz- pt p{t) — E{t), 

where E{t) is a measure of industrial activity. 

In the author's original study, the effect of the interest rate 
was not included. In spite of this omission the dominating 
influence of the force function, E(t), made possible a satisfactory 
explanation of the explosive prices of the bull market through 
the phenomenon of resonance. But the long cycles observed in 
the movement of stock prices after the collapse of the bull 
market were not adequately accounted for by the restricted theory 
except by the introduction of certain ad hoc hypotheses. It 
seems highly probable that this behavior can be accounted for 
in a satisfactory manner by the introduction of the force of 
interest. 

The most conspicuous factor that was neglected in the earlier 
study was the influence of the rediscount rate of the Federal 
Reserve Board upon the action of stock prices. This controversial 
matter — the actual effect of a change in the rediscount rate — 
has been difficult to appraise satisfactorily. But it seems quite 
possible that equation (8.1) may provide a new and more 
satisfactory approach to the problem. This can be achieved by 
making a comparison between the movement of prices for a 
given observed E (t) when p is set equal to zero, and when it is 
given a positive or a negative value. Unfortunately the scope of 
the present paper precludes an adequate statistical sttidy of 
this engaging problem, but at least its mathematical formulation 
is possible. 
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Let us begin by introducing the transformation of the in- 
dependent variable t to the variable z by means of equation (3.4). 
We thus replace (8.1) by the following: 


( 8 . 2 ) 


t d 2 P dp 


dz % 




where we use the abbreviation: 


(8.3) 


Ei (*) 


■log - 


Z/J 


K = 2 VX/p. 


The solution of equation (8.2) can be written in the form: 

(8.4) p(z) = AJ 0 (z) + BY 0 (z) 

r e i is) 

+ j 2 J Uo(s)Y 0 (z) - J 0 (z)Y 0 (s)] -±1 ds, 

where J 0 (z) and Y 0 (z) are respectively Bessel functions of first 
and second kind of order zero. The quantities A and B are 
arbitrary constants. 

If p = 0, then (8.4) is to be replaced by the following: 

i r* 

(8.5) ^>(/) = A sin a£ + £ cos ai H sin a(t — s)£(s)<fc, 


where J. and 5 are arbitrary constants and a 2 = L As in the 
homogeneous case, there appears to be no obvious way by 
means of which we can pass from (8.4) to (8.5) by a limiting 
process when p -> 0. 

Returning to equation (8.4), we see that it is possible to find 
the price which corresponds to a given impressed force, but 
there are obvious difficulties of integration which make the 
determination of price by this means impractical in general. 
Except for a few cases, which are unrealistic in the present 
problem, the evaluation of the right hand member of (8.4) must 
be achieved by the methods of mechanical integration. 

It should be observed, however, that if an observed price is 
given, the problem of finding the impressed force which generates 
it, is much easier to solve. In this case, we can regard both (8.4) 
and (8.5) as Volterra integral equations, from the inversion of 
which the desired function is obtained. Since, moreover, the 
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differential equivalents of the two equations are known, the 
solution of the problem is readily obtained. 

Thus, the solution of equation (8.1) can be written: 

(8.6) P(t) = P 0 (t) + Q(t), 

where P 0 {t) is the price when E(t) = 0, and Q{t) is the part of 
price which is generated by the force function. If, then, we 
denote by L(u) the operator 

d^u 

(8.7) L(«)^ — + U~ p *u(t), 

equation (8.1) can be written in the form: 

(8.8) E(t) = L[P + Q) = L(P) + L(Q) = L(Q). 

Therefore, given an observed price series, it is possible to 
obtain from (8.8) the market, which would lead to such a series. 
A few examples are given below as follows: 

Case 1. p = P + K, where K is a constant. Eit) = K e~ pt . 
In this case, the market starts with a demand in excess of supply, 
but equality between demand and supply is attained exponential- 
ly. (Figure 4). 

Case 1 

Q(t) E(t) 



Fig. 4 
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Case 2. p = P + A + Bt + Ct 2 , where A, B, and C are 
positive constants. E (t) = 2 C + /. e~ pt {A + Bt Ct*). In this 
case the market starts with a demand in excess of supply, 
which ultimately reaches a point of constant equilibrium in 
which the difference between the quantities demanded and those 
supplied is constant. Prices then increase according to a quadratic 
law and the effect of the interest rate is finally absorbed. (Fi- 
gure 4). 

Case 3. p = P + A + Bt, A, B > 0. E(t) = X e-^A + Bt). 
In this case the market starts with a demand in excess of supply, 
which, after rising to a maximum value, decreases exponentially 
until equality is approximated between the two factors. Prices 
rise linearly and ultimately absorb the effects of the interest 
rate. (Figure 5). 


Case 3 



|Q(t) 


A 

O t 




ol /“t 

Fig. 5 


Case 4. j> = P + A + Bt, A > 0, B < 0. E(t) = A e tp* 
(A + Bt). As in Case 3, the market starts with demand in excess 
of supply, but the demand at once declines exponentially until 
equality is reached. Thereafter, the supply is slightly in excess 
of demand, but the difference diminishes asymptotically to 
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zero. Prices themselves decline linearly until the effect of the 
interest rate is finally absorbed. (Figure 5). 

Case 5. p = P + K + A sin (kt + c), K > 0. E(t) = —A k' 2 
sin (kt + c) + K + A sin (kt + c) e~ pt . In this case the market, 
which starts with a sinusoidal movement about a constant 
positive demand, finally damps into a pure sinusoidal movement 
in which there is at one time an over supply and in another 
excess demand. 

Case 6. (p = 0). p = —(^4 /2a) t cos a£, a 2 = A. £(£) = A 
sin a i. In this case we have an example of the effect of resonance. 
A sinusoidal market, which has the same period as that of the 
free system, generates as explosive price. The bull market of 
1929 is readily explained by this phenomenon, as has been 
indicated earlier in this paper. 

Something akin to this situation is also observed when p is 
positive. Thus if we assume that p — tj 0 (z), where z is defined 
by (3.4), then we find that E(t) = pzj^z), where J x (z) is the 
Bessel function of first order. At t increases z approaches zero 
and J 0 (z) approaches unity. Hence p increases without limit. 
But as z approaches zero, J t (z) also approaches zero and E(t) 
is ultimately zero. We thus have an explosive price with a force 
function that decreases asymptotically to zero. 



PART IV 

On the Edgeworth Taxation Paradox 




WILLIAM VICKREY 


Can Excises Lower Prices? 


In the land of ceteris paribus , that kindergarten of economic 
theory where one thing changes at a time while all else remains 
frozen on the spot as in a game of “redlight," it is a commonplace 
that if a tax is placed upon an article, its price to the consumer 
will be raised, or will at most remain the same. This proposition, 
true enough when “other things" are “equal", in some sense, 
has quite generally been supposed to apply also in the real world 
where “ceteris" are notoriously reluctant to remain “paribus". 
To be sure, it is difficult, offhand, to produce an example from 
real life that can confidently be said to controvert this prop- 
osition. But the complacency with which economists had been 
extending their results from the partial equilibrium world of 
ceteris paribus to the real world of complex interrelations was, 
for the more sophisticated of them, rather rudely shocked when 
Edgeworth, in 1897, managed to produce an apparently reasonable 
and consistent example of a case where a tax on a certain class 
of railway ticket could induce a profit-maximizing monopolist 
to reduce the rate for all classes of tickets, inclusive of tax. 1 ) 

However, the effect of this paradox on economic thinking has 
been considerably less devastating than might have been 
expected. The model of necessity involved considerable mathe- 
matics, and those outside the inner circle of economists versed 
in mathematics were prone to suspect some piece of legerdemain 
concealed in the works. Furthermore the example involved 
monopoly, and at that, a type of monopoly generally subjected 
to public control rather than left to maximize its profit as best 
it might, so that even those few economists who were familiar 

1 ) F. ^T. Edgeworth, “La Teoria Pura del Monopolio,” Giomale degli Eco- 
nomist , Vol. 15 (October, 1897), pp. 307 — 320, esp. p. 317. “The Pure Theory 
of Taxation”, Economic Journal, VII (June, 1897) 231; reprinted in Papers 
Relating to Political Economy (London, 1925), I, 132, 143; II, 93, 401. 
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with the paradox were inclined to treat the example as a curio- 
sum, not likely to have any real life counterpart. The reaction 
of Seligman is perhaps typical: "The mathematics which can 
show that the result of a tax is to cheapen the untaxed as 
well as the taxed commodities will surely be a grateful boon to 
the perplexed and weary secretaries of the Treasury and ministers 
of finance throughout the world !” 2 ) 

It remained for Harold Hotelling to show, in 1932, that the 
phenomenon was also possible under conditions of perfect 
competition; but again the rather formidable looking mathe- 
matics involved tends to scare away most economists. All that 
Hotelling says in non-mathematical terms about the conditions 
that are required, if a tax levied on one of two related commod- 
ities is to lower both their prices, is that the two commodities 
must compete both in production and in consumption. But 
obviously this is not enough of itself to produce the paradox, 
and we cannot say that the phenomenon is likely merely because 
a large number of commodities satisfy these conditions. Some- 
thing more is obviously required. It is the purpose of this article 
to set forth with the aid of as elementary mathematics as is 
possible just what that something more is. 

Unfortunately there seems to be no way to avoid some 
recourse to mathematics. But I will try to come out with 
something more than a mere set of equations. To simplify the 
discussion as much as possible, it will be assumed that we are 
dealing with only the two-commodity case under conditions of 
perfect competition. That is, we will consider only the case 
where the paradox can arise through the interrelations between 
only two commodities, and cases involving longer and more 
complicated chains of relations will be left aside for the moment. 
Also, cases requiring a greater or lesser degree of monopoly 
will be ignored. Moreover, it will further simplify matters to 
assume that the importance of the commodities considered in the 
budgets of individuals is so small that changes in the marginal 

2 ) E. R. A. Seligman, Shifting and Incidence of Taxation (4th ed.; New York, 

1921), p. 214n. m 

3 ) Harold Hotelling, “Edgeworth’s Taxation Paradox and the Nature of 
Demand and Supply Functions”, Journal of Political Economy, XL (1932), 
577—616. 
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utility of money, i.e., the ''income effect” can be neglected and 
that individuals are able to devote more or less of their resources 
to the particular commodities considered without large changes 
in standard of living. 

It can be shown that the existence of the Edgeworth paradox, 
for sufficiently small rates of tax, depends only on the second 
derivatives of the utility and cost functions, which correspond to 
the first derivatives of the demand and supply functions. 
Accordingly we may approximate the utility and cost functions 
in the neighborhood of the equilibrium point (with no taxes) 
by functions of the second degree only. 4 ) 

Accordingly we may write: 

( 1 ) w = a(x — x) 2 + 2b(x — x){y — y) + c{y — y) i 

+ p x (x — x) + p y (y — y) + W 

for the total cost function, W being the total cost of producing 
amounts x and y of the two commodities, the remaining symbols 
being constants. If x and y are the quantities produced under 
equilibrium conditions with no tax, then W is the cost of producing 
this equilibrium output. Under competitive conditions the price 
realized by producers will be equal to marginal cost, so that we 
can put 


( 2 ) 


p x = M x = — = p x + 2 a(x — x) + 2% - y) 
dW . 

p v = M v = - 5 - = Pv + 2b ( x — *) + 2 c[y — y) 

ay 


Thus the constants p x and p y are the equilibrium prices obtaining 
in the absence of tax, justifying the symbol used. It will be 
shown in due course that all of the constants must be positive. 
Indeed for an equilibrium in which sellers assume prices to be 
constant, the marginal cost of x must increase with increasing 
x , and similarly for y , so that the constants a and c are im- 
mediately seen to be positive. A third requisite for the equilib- 
rium to be stable is that ac exceed b 2 . This condition corresponds 
to the increasing marginal rate of substitution, or that the curves 
of constant cost in the x , y , plane be concave towards the origin. 


*) For the completely general treatment the reader is referred to the Hotelling 
article cited above. 
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These conditions are necessary in order that the producer be at 
a point of maximum profit rather than of minimum profit or 
possibly at a minimax or saddle point (which would admit of 
increasing profits by changing output, assuming prices to remain 
constant.) 

Similarly we may write 

(3) U = ~A(x — x) 2 — 2B(x — x)(y — y) — C(y — y) 2 
+ P x (x — x) + P y (y — y)-\-U 

for an aggregate utility function, the utility function of each 
individual being added in with such weight as to make the 
marginal utility of money unity for all persons. With total 
utility thus measured in terms that make the marginal utility 
of money one, we can equate prices to consumers with marginal 
utilities as follows: 

P„ = ^ = U x = P x — 2A(x — x) — 2B(y — y) 

^ dU 

P v = 3 - = U y = P y - 2B(x - x) - 2C(y - y) 
dy 

Again we see that P x and P v turn out to be the prices obtaining 
at the equilibrium consumption x and y . The constants A and C 
must be positive, if the marginal utility of x is to decrease with 
increasing x, and similarly for y. Further, for consumers to be 
in stable equilibrium when confronted with the opportunity 
to buy at prices they consider constant, we must have AC greater 
than B 2 . This corresponds to the diminishing marginal rate of 
substitution for consumers, and insures that the indifference 
curves in the xy plane will be concave away from the origin. 
These conditions are necessary in order that consumers be at a 
point of maximum utility rather than at a minimum or a 
minimax, assuming that prices are to be constant. If these 
conditions are not satisfied it would be possible for consumers 
to improve their position by moving away from the equilibrium 
point in a proper direction. 

Now the price to the consumer will exceed the price realized 
by the seller by the amount of the tax, so that we have 

( 5 ) t X + t X = P x , fiy + ty= Py, = P „ ^ y - P y. 
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Solving these equations for the tax rates, and making use of (2) 
and (4), we have 

/ ft » ^ = p x — px= -2(4 + <*)(* — *)— 2 (5 + b){y - y) 

[ ) t y = P v -p v = -2(5 + &)(*- x) — 2(C + c)(y- y). 

These equations can be solved for % and y in terms of the tax 
rates, giving us 


' 

2 

t x B + b 


ty C + C 

JU U/ — i 

a\ 

A + a B + b 

(7) 

B + h C -j- c 

9 

A + a t x 


B + b c y 



y — y 

a\ 

A + ct b + b 

(8) 

B -j- b C -f- c 


- ^MC + c) -t v (B + b)} 

— TTn for (4 + a ) — t K [B + &)] 


(9) 


where D = 


4 "I - u 5 — |- b 

B + b C + c 


= (4 + a)(C + c)- (5+6)2 


these expressions can in turn be substituted for x and y in equa- 
tions (4), yielding the following expressions for the prices to 
consumers in terms of the taxes: 


„ * 4 

P — P 4 

• • + D 


( 10 ) 

P = P + — 


tx 

B 

+ b 

1 

B 

A 

+ 

a 

t x 

ty 

C 

o 

+ 

D 

B 

+ 

b 

tv 

= 

P+ — 
D 

A(C + c)~ 

•5(5 

t. 

B 

+ b 

1 

c 

A 

+ 

a 

tx 

tv 

C 

+ c 

+ 

D 

B 

+ 

b 

tv 


iy 

D 


= + ^ l c ( A + «)~ B ( B + *)J + | (Sc - bC). 


Consider the effect of imposing a tax on x. If this is to diminish 
the prices of both x and y, we must have, from (10) 

3P 1 8 P 1 

(11) ^=5^( c + c )- b ( s + 6 )]< 0 ; -df x = D {Bc ~ bC)<0 

D, however, can be shown to be positive. For D can be written: 
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(12) D = (AC — B 2 ) + (ac - b 2 ) + (aC + Ac - 2Bb ); 

the first two parentheses are positive by reason of the maximum 
conditions; in the final parenthesis we have: 

(13) aC + Ac > 2VaCAc = 2 V AC Vac > 2Bb, 

so that all three parentheses are positive and thus D is positive. 
Hence the conditions for a tax on x to increase the prices of 
both x and y are: 

(14) A(C + c) - B(B + b) = AC - B 2 + Ac - Bb < 0 

(15) Be — bC < 0. 

To show that b and B are positive, we multiply (14) by C 
(which is positive), and add and subtract B 2 c, getting: 

(16) c{AC-B 2 )+ACc-BCb-B 2 c+B 2 c < 0, 

or ( C+c){AC-B 2 )+B(Bc-bC ) < 0. 

In this last expression the first two parentheses are both positive, 
and the last parenthesis is negative from (15); thus if the whole 
expression is to be negative, B must be positive. If B is positive, 
then from (15) b must also be positive. This means that the two 
commodities x and y compete both in production and consump- 
tion. That is, the more of one that is consumed, the smaller will 
be the marginal utility of the other, and the more of one that is 
produced, the greater will be the marginal cost of producing 
the other. This is the condition verbalized by Hotelling. Com- 
bining the conditions (14) and (15) with the conditions that a, 
b, c, A, B, and C all be positive, and the convexity conditions 

(17) ac ~b 2 > 0; AC — B 2 > 0 

we can obtain the following chain of relationships: 

a ac—b 2 ( b ^ b ^ A t AC—B 2 ^ A 

.... b be c > c > B+ Be > B 
(18 

_ AC ~B 2 B ^ B 
~ ~BC + C > C' 

In this chain, the first relation is an identity, the second iollows 
from (17) (plus the fact that all the coefficients are positive); 
the third follows from (14), the fourth again from (17), the 
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fifth is an identity, and the sixth follows from (17). (15) is 
needed only to show that the coefficients are actually positive, 
and in fact if all coefficients are positive, (15) follows immediately 
from the last four relations in the chain. That is, if a tax on x 
lowers the price of x t and the commodities compete in production 
and in consumption, the tax on x will also lower the price of y . 

It is not enough, however, to produce the paradox, to say that 
the two commodities must compete in consumption and produc- 
tion. To examine these conditions further, we can rewrite the 
utility and cost functions in such a way as to split up the two 
commodities into two independent factors. That is, we can think 
of the production of the commodities as being carried on in, 
say, two independent stages, one stage being common to both 
products, and the other production process being applied to 
onfy one. For example, one can consider as two separate products 
ground coffee (x) and coffee in the bean (y), the growing, curing, 
and roasting of the coffee being one process applied to both 
products and the grinding a supplementary process applied only 
to the x. If these processes are independent in the sense that 
the volume of one process has no effect on the costs involved in 
the other, then we should be able to write W=f{x)+g(y+o:x) ) 
where a is the number of units of x that correspond to a unit 
of y in the common process; in the case of ground and bean 
coffee, this is obviously pound for pound, or unity. In general, 
we find that in order for the processes to be independent, we 
must put 

b 

(19) «-7 

and we can write the cost function as: 

ac-b 2 / . b _ ac-b 2 \ (by 

W= — - — x 2 -\- \p x — —p v —2 — - — x\x+c\y+ — xj 

( b \ 

+ (fi y —2bx—2cy) (y+ — x^+W 0 , 

where •W'o represents the fixed costs at zero output. In general 
the smaller the term (ac — b 2 )jc, the less variable will be the 
marginal cost of the separate process. This corresponds to the 
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fact th at the more constant the marginal cost of the supplemen- 
tary process the more closely competitive the products will be as 
to supply. 

Similar ly, we can decompose the utility function by considering 
that there is a common service that can be rendered indifferently 
by either commodity, but that there is an additional service 
provided by one ( x ) but not by the other (y). Thus we may 
assume for present purposes that either coffee ground at the 
store or coffee ground on the premises will serve to produce a 
satisfactory beverage, but that having the coffee ground at the 
store provides an additional satisfaction in terms of convenience. 
If these two satisfactions are to be independent of each other, 
then B/C must be the amount of the x necessary to substitute 
for one unit of the y in terms of the common satisfaction (note: 
this is not the same as the marginal rate of substitution, as it 
refers only to a part of the total satisfaction derived from the x). 
Thus we rewrite (3) as follows: 

rr (AC—B 2 \ . / B n AC-B 2 \ / B\ 2 

u — {— —r + \ p --c p ’ +2 —c- T- c (' ,+ c a ’) 


( 21 ) 


+ (P y + 2 Bx + 



where U Q represents the level of utility when no x or y is consumed. 
Now we see from (18) that this ratio B/C must be less than the 
ratio b/c obtaining in the common manufacturing process. That 
is, if in the growing and roasting of coffee the equivalence is 
pound for pound, then in the brewing of coffee it is necessary, 
for the Edgeworth phenomenon to occur, that more of the pre- 
ground coffee be required to produce a given number of cups of 
beverage of a given quality than of the bean coffee ground on 
the spot, in this case a not unreasonable hypothesis owing to 
loss of strength, and possibly wastage where the remainder of 
a supply becomes unfit for use. 

Here then emerges a more or less plausible pattern where the 
Edgeworth phenomenon would be likely to occur, under com- 
petitive conditions. One can envisage a situation in which con- 
sumers are on the whole willing to pay a certain premium for 
the convenience of not having to grind coffee at home, but will 
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to a large extent forego this convenience if its cost is much 
increased; under these circumstances a tax on ground coffee, 
raising the cost of this convenience, might lead to a substantial 
shift from ground coffee to bean coffee. But since there is less 
wastage with the bean coffee, the total demand for the growing 
and roasting of coffee may decline, and is particularly likely 
to do so if the demand for coffee as a beverage is inelastic. If 
the supply of coffee is inelastic (as was in fact the case prior 
to the coffee valorization schemes of Brazil) this diversion to a 
more efficient (but less satisfying) method of utilization may well 
reduce the price of the green coffee and with it the price of bean 
coffee to the point where it would more than offset the tax on 
that portion that is still pre-ground. More cups of coffee would 
be consumed, but fewer pounds. 

The essential requirement, then, when the matter is analyzed 
in this way, is that more of the taxed article be required to 
replace one unit of the untaxed article in providing for the 
common element of satisfaction to the consumer than can be 
procured in exchange for one unit of the untaxed article in the 
common production process. Moreover, the difference between 
these substitution ratios must be greater, the less perfect is the 
substitution in consumption between the two commodities, and 
the more elastic is the output of the common production process. 
For we can rewrite (14) as follows: 


( 22 ) 


Bb> AC 
b_ B 

7 > c L 


i + 


B 2 + Ac 
'AC - 5 2 


l AC - B 2 \ ( C\1 

(— bH( 1+ 7)J 


We may take (AC — B 2 )/B 2 as an index of the degree of im- 
perfect substitution, being zero when the two commodities are 
perfect substitutes and infinite when they are completely 
independent; the factor (1 + C/c) reflects the elasticity of demand 
for the untaxed commodity, relative to that of its supply. 

It is not necessary, however, to treat the taxed commodity 
as the one which undergoes a second process or the one which 
provides a second type of satisfaction. The untaxed commodity 
may equally well be regarded in this light. In this case the amount 
of the taxed commodity equivalent to a unit of the untaxed is 
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a/b in the common production process, and A/B in the common 
form of satisfaction; as before, it must require relatively more 
of the taxed article to take the place of a unit of the untaxed 
in giving the common satisfaction than is produced in the com- 
mon process with the resources required to produce one unit 
of the untaxed item. An example of this type might be plain and 
creosoted railroad ties: it may require two or three plain ties 
to give as many years of service as one creosoted tie, and in 
addition the labor of replacement is saved, wdiereas the cost of 
cutting ties may be much the same whether or not the ties are 
later to be treated with preservative in a separate process. A 
tax on the commodity that loses in its relative importance as 
we pass from the common process (cutting) to the common 
service (rail-bearing) (in this case the plain tie), might well 
induce the substitution of more treated ties, decrease the number 
of ties needed for a given amount of service, and if the supply 
of tie lumber is inelastic, might conceivably drive the price of 
tie lumber down by more than the tax. This requires of course 
not only that the supply of tie lumber be inelastic so that a drop 
in demand will cause a substantial drop in price, but also that the 
supply of the treatment process be reasonably elastic so that the 
cost of this processing will not go up too much when the demand 
is increased as a result of the tax. 

In cases of this type we may derive from (18) the following 
expression for the required difference between the ratios: 

_ . a A ac — b 2 AC — B 2 

< 23) If-ip-S bT~ 

Here we see that the required difference becomes larger the less 
perfect the competition between the commodities either in con- 
sumption or in production. 

It would also be possible to have a situation where one of the 
two commodities underwent the extra processing while the other 
conferred the extra benefit. Realistic examples of this are a bit 
hard to find, but possibly a number of food preserving practices 
might come under this heading. The preserving process may be 
considered to reduce spoilage and thus reduce the amount of 
raw produce required to provide a given number of calories to 
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the ultimate consumer, as compared with unpreserved produce. 
Some of the processes appear to be in varying degrees deleterious, 
so that fresh produce has a separate utility attached to it in 
the shape of the absence of these deleterious effects. If consumers 
were adequately aware of these factors and willing to pay a 
premium price for produce not so processed, an Edgeworth 
phenomenon might be obtainable by putting a tax on the non- 
preserved varieties, inducing a shift to the preserved varieties, 
reducing the demand for raw produce, and driving the price 
of raw produce down, possibly by more than the tax. 

There is also the symmetrical possibility of a tax raising the 
price to the buyer by more than the tax, and thus raising the 
price received by sellers for both the taxed and untaxed items. 
Instead of (18), for this case we have, for a tax on y , the con- 
ditions 

a ae—b 2 b b A ac—b 2 A 

— — 1 >_>_| — > — 

b be c c B Be B 

AC-B 2 B B 

— 1 > — . 

BC C C 

The only difference is in the fourth expression, and in the fact 
that the tax is now on y rather than x. 

A possible instance of this sort is the margarine tax: the 
acreage required to produce a pound of butter appears to be 
substantially greater than that required to produce a pound of 
margarine, while the substitution nutritionally is pound for 
pound; if the demand for table fats is very inelastic, the tax 
on margarine might mean that more acreage would be required 
to supply the demand for table fats, and land prices, rural rents, 
fodder prices, and the price of margarine ingredients might all 
be increased as a result of the tax. This is to some extent the same 
configuration that would lead to the possibility of a tax on butter 
reducing the price of both butter and margarine to the consumer. 
The one, however, does not necessarily involve the other, though 
if the commodities are extremely close competitors, it is likely 
that the two phenomena will occur together or not at all. In 
this particular instance, however, it appears likely that the 
demand for table fats is sufficiently elastic and the competition 
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between margarine and butter for the use of land is sufficiently 
indirect that the Edgeworth phenomenon is actually not likely 
to emerge. 

T his by no means exhausts the possibilities, but it does provide 
a few hints as to where to look for the phenomenon and what its 
nature is. In particular, although mathematically it is possible to 
substitute for interdependent commodities properly selected 
“baskets” that are mutually independent (at least in a small 
neighborhood), this mathematical process need not correspond 
to any identifiable “satisfactions” or “processes” as we have 
here assumed, and it would be quite possible for the Edgeworth 
phenomenon to occur without such mathematical breakdowns 
having any obvious real counterpart. Moreover, we have examined 
only the case of two interrelated commodities, and it is clear 
that if we allow for more complex interrelations among several 
commodities additional possibilities may occur. But these simple 
problems may serve to provide some slight further insight into 
the problems of tax incidence. 

None of the foregoing serves to invalidate in any way the 
proposition that from an inclusive viewpoint excise taxes that 
produce divergences between marginal cost and price to the 
consumer impair the general welfare. All that has been shown is 
that under some circumstances the imposition of a tax may 
redound to the net benefit of some of the parties to the taxed 
transaction, in this case, the consumers. The other parties (here, 
the sellers) will in general lose by more than the sum of the 
benefit to the gainers and the revenue to the government. Thus 
unless the sellers are deemed in some way less worthy of con- 
sideration than the buyers, such a tax is still not a desirable 
one from the point of view of the community as a whole, nor can 
the discovery of an instance of Edgeworth’s phenomenon be 
considered necessarily “welcome news to the harassed minister 
of finance.” For we can write for the excess of utilities over 
costs: 

( 25 ) S=V-W 

=B— ( A +a) (x—x) 2 — (B+b) (x—x) (y—y) — (C +c)\y—y ) 2 
This function is thus at a maximum when x — x and y -y- 
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That this is a true maximum and not a minimax is assured by 
the fact that D is positive. 

Even if we were to admit that the buyers for some reason 
should be benefitted at the expense of the sellers, it would in 
general be desirable to levy a direct tax on the rents, income, 
or other surplus of the sellers and pay a direct subsidy to the 
buyers, rather than achieve the redistribution by means of an 
excise tax. To be sure, if some sort of excise is to be used as the 
means of achieving this redistribution, then a tax in an Edge- 
worthian situation would be relatively more effective in achieving 
this redistribution than a tax on an independent commodity 
where the amount of redistribution would necessarily be less 
than the tax revenue. But in general an excise tax cannot be 
justified by such an argument except where the person who is 
to bear the burden would otherwise be beyond the reach of the 
taxing authority. The usual case of this sort is when the seller 
is a foreigner. In the coffee case cited above, the redistribution 
carried out is one in favor of the domestic coffee drinker at the 
expense of the Brazilian coffee grower. But this is in reality 
no more than a peculiarly effective instance of monopsonistic 
exploitation by the taxing country, and unless such action is 
taken as a means of adjusting disequilibria in international trade, 
it could well be considered a form of economic warfare. 

In short it is wrong to dismiss the Edgeworth paradox as 
something that can happen only in rare and peculiar cir- 
cumstances. The phenomenon is not frequent, to be sure, but 
there seems little doubt but that cases can occur and that the 
circumstances tending to produce the phenomenon can be rec- 
ognized if one knows what to look for. On the other hand, 
even when found, instances of the phenomenon are far from the 
“grateful boon” that Seligman thought that they would provide, 
did they exist. At least in domestic matters, existence of an 
Edgeworth phenomenon in no way justifies a departure from 
normal standards of tax policy. It is only where the analysis 
relates to the foreign policy of a nationalistic state, bent on 
deriving the maximum advantage from its strategic position, 
that the discovery and exploitation of the phenomenon becomes 
of concern. 



CHARLES E. FERGUSON 


Modified Edgeworth Phenomena and 
the Nature of Related Commodities 


I. Introduction 

In one of his most celebrated contributions to economic 
theory, 1 ) Professor Hotelling analyzed the so-called Edgeworth 
Taxation Paradox, viz., the proposition that an excise tax imposed 
upon one of two related commodities may result in a decline in 
the price of both. Far from being a mere curiosum as Edgeworth 
suggested, he found that the conditions of the Paradox are satis- 
fied in a wide class of cases conforming to certain theoretical 
requirements which he adduced for demand and supply functions. 
More specifically, Professor Hotelling proved that in both 
competitive 2 ) and monopolistic markets, the Edgeworth phe- 
nomenon is possible if the two commodities are substitutes in 
both consumption and production. 

At this quarter-century anniversary of Professor Hotelling's 
important paper, it seems appropriate to extend his analysis to 
modified Edgeworth cases. In particular, we investigate the 
consumption and production relationships that must prevail if 
an excise tax imposed upon one commodity results (a) in a 
decrease in its price and an increase in the price of the other 
(related) commodity, and (b) in an increase in the price of the 
taxed commodity but a decrease in the price of the related 
product. As a convention, and without loss of generality, 
throughout the paper we assume that the tax in question is 
imposed on good X v 

1 ) Harold Hotelling, “Edgeworth’s Taxation Paradox and the Nature of 
Demand and Supply Functions,” Journal of Political Economy , XL (1932), 
pp. 577—616. 

2 ) “Competition” is used here in the Cournot sense of competition between 
two monopolists. 
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II. Analytical Framework 

Consider two related commodities, X x and X 2) whose demand 
and supply functions are given by 

(1) j> t = /<(*!, x 2 ) and pi = g,fa, x 2 ) (i = 1, 2) 

respectively; or, assuming that the Jacobian of transformation 
does not vanish, by the inverse relations 

= F i(Pv Pa) and x ( = G { (p lt p 2 ) (i = 1, 2). 

We assume that these functions satisfy the theoretical require- 
ments prescribed by Professor Hotelling for the case of unlimited 
budgets. Denoting partial derivation by an additional subscript, 
these requirements are, in terms of equations (l): 3 ) 

(2) fij = f 0 v Sa — Sa> Un ] ^ [g 4j ] 0. 

Furthermore, we adopt the definition of complementarity 

suggested by Professor Hotelling and others. 4 ) Specifically, X x 
and X 2 are competitive or complementary according as 
F 12 = F 21 $ 0. However, since we will use equations in the 
form displayed in (1), it is necessary to express this definition 
of complementarity in terms of the /- and g-functions. Let 

9 fa, x 2 ) 

Pa) ' 

Then obtaining the inverse of the matrix of partial derivatives 
by the adjoint method, we can display the relationship between 
the partial derivatives of the quantities with respect to the prices 
and the partials of the prices with respect to the quantities: 

F 12 == -^21 — —Jfil — Jfl 2> F 22 ~ JflV 

Since we have assumed that the quadratic form is negative 
definite, 5 ) / > 0. Hence the sign of / 12 = / 21 , is the opposite 
of the sign of ^12 — ^21* 

3 ) Hotelling, op. tit., p. 591 and p. 597. More specifically, the two important 
properties are: (a) the “symmetry” or “integrability” conditions; and (b) 
negative (positive) definiteness of the quadratic form of the coefficients of the 
demand Jsupply) functions. 

4 ) Ibid., p. 599. See also Oscar Lange, “Complementarity and Interrelations 
of Shifts in Demand,” Review of Economic Studies , VIII (1940 — 41), 59. 

5 ) This assumption is expressed in relations (2) for the f- and g-functions. 
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Accordingly, X 1 and X 2 are competitive or complementary in 
consumption according as f n % 0. Similarly, by an extension 
of the Hotelling definition, the goods are competitive or com- 
plementary in production according as G n — G 21 ^ 0. Con- 
sequently, X-y and X 2 are also competitive or complementary 
in production according as g 12 = g si 5 0. 

Before proceeding to the analysis of cases, it is convenient to 
establish the equations expressing the change in commodity 
price that is attributable to an excise tax. This is done for both 
competitive and monopolistic markets. 

Let hi — fi — g it so that 

Ki — ha — dhJdXj = dhJdXi, 


and designate the accompanying Jacobian by 

i 3(Aj, h 2 


D = 


d(xi, x 2 ) 


> 0 , 


since the elements of D are the differences between the cor- 
responding elements of a negative definite and a positive definite 
quadratic form. If and are the equilibrium values of price 
and quantity that bring competitive demand and supply into 
equality, then 

(3) # 2 ) == 0 (^ = 1, 2). 


Assume that a small excise tax of t L per unit is imposed upon 
X x and designate the resulting prices and quantities by fi+Ap it 
— Ax t . In this situation, the demand price of the taxed 
commodity must exceed its supply price by the amount of 
the tax. That is, 

(4) h^+Ax^ x 2 +Ax 2 )=zt v h 2 (x 1 +Ax v x 2 +Ax 2 )=0. 

Subtracting equation (3) from equation (4) and applying the 
definition of “derivative”, we obtain 


(5) 


2 

2 h^AXj = d u t v 




(*' = l, 2 ) 


where <5 lt is the Kronecker delta. Equations (5) are simultaneous 
linear equations which are solvable by Cramer’s Rule for Ax t : 


,(6) 



h ^12 
0 h 22 


Ax o — — 
2 D 


h n 

K 


h 

o 
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From equation (1), 

A Pi = lfii Ax r (* = 1 , 2 ) 

Substituting the expressions for Ax t from equation (6), 

k 

dpi == (/ll^22 /l2^2l)> 

dp %= — (/ 21^22 ~ / 22^2l) * 

Finally, substituting the expression / w — g^ for h u and recalling 
that ^ > 0 and Z) > 0, we can write the following expression 
relating the direction of change in price to the imposition of an 
excise tax on the first commodity: 

(7a) sign (J&) = sign (/ n / 22 - / u g 22 - /f 2 + / 12 g 21 ), 

(7b) sign (J^> 2 ) = sign (/ 22 g 21 - / 21 g 22 ). 

Equations (7) express the price changes attributable to an 
excise on X x when the two related commodities are produced 
by competing entrepreneurs. We turn next to the situation that 
prevails when both commodities are produced by a single 
monopolist. As Professor Hotelling has pointed out, in the 
monopoly situation the supply relationship between commodities 
is immaterial. Accordingly, we investigate the consumption 
relationships under monopoly using the convenient fiction of 
costless production. 

After the imposition of an excise tax on the first commodity, 
the monopolist maximizes the resulting profit equation 

Pv^i Pz'X'z k'X'i = ^ 
thus determining quantities so that 

ti-l — t lf 7i 2 = 0, and [n tj ] 0. 

By differentiating the two first-order conditions with respect 
to the tax rate, we obtain 

2 

3**1 

where is a Kronecker delta. These are two linear equations 
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which may be solved for the unknown tax derivatives. Applying 
Cramer’s Rule, 


cofactor of n lj 

x it~~ 


IK] I 


fa- 


il = 1, 2) 


Then using equation (1), the price changes corresponding to 
a tax on X x can be determined: 


2 2 

Pit = 2 fa* a = 2 faVii- 


(* = 1 , 2 ) 


or alternatively: 

(8a) sign (p lt ) = sign (/ ll9 > n + / xa p M ), 

(8b) sign {p„) = sign (/ 2l9 9 u + /asPis)- 

Equations (1), (2), (7), and (8) and the definition of complemen- 
tarity provide all that we need of a purely formal nature. 


III. Modified Edgeworth Phenomena in a Competitive Situation 

Professor Hotelling has shown that a necessary condition for 
the Edgeworth Paradox in a competitive market is that the two 
commodities compete in both consumption and production. 6 ) 
We now investigate the possible commodity relationships for 
modified cases of the Edgeworth Paradox. In doing so, we 
utilize the convenient feature that the possible existence, in any 
specific case, of the modified Edgeworth phenomena can be 
established by providing an example of demand and supply 
functions which meet all requirements of the situation. On the 
other hand, to establish the impossibility of the phenomena 
requires a formal proof. 

A. Case I: Price of X x Lowered, Price of X 2 Raised. 

The first modified case considered is that in which an excise 
tax imposed upon X x results in a decrease in its price, accom- 
panied by an increase in the price of the related commodity X 2 . 
From equations (7), this requires that 

(^ a ) /11/22 — /l2 < /ll§22 /l2?2D 

(^k) /22&L2 > i 12822- 


8 ) Hotelling, op. cit., p. 604. 
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Since, from inequalities (2}, / u / 22 — f\ 2 > 0, / u < 0, and 
g 22 > 0, to satisfy (9a) we must have — / 12 g 21 > 0. This requires 
that either / 12 < 0 or g n < 0, but not both. 

Consequently, if g n < 0, then / 12 > 0. Using this information 
in (9b), together with the knowledge that / 22 < 0 and f 2z g iz > 0, 
we see that /l2§22 > 0. Therefore, it is possible for Case I to 
exist if the two commodities are complementary in both con- 
sumption and production. Indeed, an example of this which 
satisfies all requirements is given by the following set of equa- 
tions: 7 ) 


Demand 


Pi = 11 — + 2z 2 > 

£2 = 30 + 2x x — 5# 2 > 


Supply 


£1=1 + 5#!— 2x 2 , 
£2 = 10 — 2#! + x 2 . 


On the other hand, since the signs of / 12 and g 21 cannot be 
the same, it is impossible to obtain the desired results if the two 
commodities are competitive in consumption and complementary 
in production, or vice versa. Furthermore, Case I is impossible 
if the commodities are substitutes in both consumption and 
production. This situation requires /12 < 0 and g 12 > 0. That 
is, it is necessary to satisfy inequalities (9) subject to the addi- 
tional inequalities: 


fiz < 0, g l2 > 0, / n < 0, / 22 < 0, g n > 0, g 22 > 0, /n / 22 — /i2>0. 


The last inequality above implies 


( 10 ) 


f Ilf 22 ^ 

f 2 ^ 

/ 12 


1 . 


Now since / 22 < 0 and / 12 < 0 by the special hypothesis, the 
second inequality in (9b) may be written ^giai < 1/12^22!* 
This, in turn, implies 


’) The procedure for obtaining examples of Edgeworth or modified Edgeworth 
phenomena is as follows: determine the direction of the inequalities relevant 
for equations (7); then select f if and g if values to satisfy these inequalities. 
The values are the coefficients of in the demand (supply) equations. 

Appropriate constants may be selected by solving the demand and supply 
equations simultaneously. To check the equations, introduce an excise tax on 
one of the commodities. Remembering that for the taxed commodity, demand 
price mmt exceed supply price by the amount of the tax, the demand and 
supply equations, properly adjusted, may be solved simultaneously for 
and % it both expressed in terms of the tax rate. Differentiating the expressions 
for pi with respect to the tax rate then confirms the result. 
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(11) / 22//12 < gzzlgu- 

In light of (10), the inequality in (9a) may be written |/ n g 22 | < 

l/u&il. or 

(12) / 11//12 < gnlSw 
Multiplying (11) by (12), we have 

/ 11/22 &22 S 12 _ , 

/ 2 ^ 

12 Sl2 S 22 

thus contradicting the special hypothesis embodied in inequality 

( 10 ). 

B. Case II: Price of X x Raised, Price of X 2 Lowered. 

The second modification that we investigate is the case in 
which an excise tax levied on the first commodity results in an 
increase in its price but a decrease in the price of the related good. 
From equations (7), this result is possible if 

(I3 a ) /ll/22 ~ /l2 > /ll?22 fnSzv 

(13b) fzzSlZ < /l2§22* 


Inequalities (13) are satisfied by three different sets of com- 
modity relationships. In the first place, if the two goods are 
substitutes both in consumption and production, Case II is 
satisfied by the following set of equations: 


Demand 


— 41 “ 5oi/j ~ 2x 2 , 

p2 = ^1 % X 2> 



Pi — 1 + + ^ x 2> 

p 2 = 1 + 4x x + 2x 2 . 


Secondly, Case II is also possible when the commodities are 
complementary in consumption and competitive in production. 
An example of demand and supply functions satisfying these 
conditions is given by 


Demand 


Pi = 12 — + x 2 , 

^ 2 =11+ — x 2 . 



Pi = 3 + X 1 + X 2 r 

p 2 — 2 + x 1 + 2x 2 . 


Finally, if the two commodities are complements in both con- 
sumption and production, the modified Edgeworth results can 
be obtained. An example meeting these conditions is ^ 


Demand 


Pi = 20 — 5x x + 4x 2 , 
p 2 = 18 + 4x 1 — 4x 2/ 


Supply j 


*pi — 6 'r- Zx x — 3 ce 2 , 
^>2 = 4 — 3 ^ + 4 x 2 . 
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On the other hand, Case II is not possible if the commodities 
are substitutes in consumption and complements in production. 
For if this were so, f lz < 0 and g 12 < 0, implying / 22 g 12 > 0 
and /i 2 g f 22 < 0. But these two inequalities are inconsistent with 
(13b), and, accordingly, the special hypothesis must be rejected. 

IV. Modified Edgeworth Phenomena in a Monopoly Situation 

We now investigate the feasibility of the modified Edgeworth 
phenomena in a monopoly situation, using the same two cases 
discussed in Section III above. As stated previously, Professor 
Hotelling has proved that a tax on one commodity can result 
in a decrease in the price of both if the goods are substitutes 
in consumption, the production relationship being less material 
under monopolistic organization. 

A. Case I. 

From equations (8), an excise tax imposed upon X x will result 
in a decrease in its price and an increase in the price of X 2 if 8 ) 

( 14 a) fuVn + fi 2 <Pi 2 < 0, 

(14b) /ia¥>ii + /sa9>i2 > °- 

Of the two possible relationships, Case I may exist if the com- 
modities are complementary in consumption. This is illustrated 
by the following quadratic demand functions: 9 ) 

8 ) It should be recalled that we have the additional information that 
tii < 0, <Pii < 0, I [/„][ > 0, and |[<p w ]| > 0. 

®) The procedure involved in obtaining examples for a monopolistic market 
can be sketched briefly. Determine the relevant inequalities for equations (8) 
and select a set of values for f i} and <p is which satisfy these inequalities and the 
theoretical requisites of demand functions. Next obtain the three different 
equations for these equations involve x { , f iS and f iik . Using the previously 
assumed values of f tj and postulating equilibrium values for x t , these equations 
may be solved recursively for f ijk . Then postulate two quadratic demand func- 
tions with unknown coefficients and constants. Differentiating these quadratic 
functions twice, we obtain expressions for f ijk in terms of the coefficients of 
the demand functions. Consequently, since values have previously been deter- 
mined for f i3k , one may obtain numerical solutions for the coefficients. Next 
substitute the numerical values into the demand functions, and the latter into 
the profit equation of the monopolist. Then state the first-order conditions 
for maximum profit and substitute the assumed equilibrium values of into 
these equations. The resulting expressions may be solved for the constants 
of the demand equations. The functions thus obtained may be checked by 
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==: lt X l — X 1 X 2 & X 1 "4“ ^*^2 H"" 2 1 
Pi = — i*i + + 3®j - 2a; 2 - 2. 

On the other hand, Case I is impossible if the commodities 
are competitive in consumption. For if so, / 12 < 0. Then with 
f ti < o and <p u < 0, in order to satisfy (14a), we must have 

9 hi > °- But then \fu?n\ < I/ 12 9^12 1 > or 
(15) / 11//12 < fulfil- 

To satisfy (14b) it is necessary that |/ 12 <p u | > I/ 22 or 

(15) / 22//12 < <Pnl<Pu- 

Multiplying (15) by (16), we obtain 

(17) fnhi < fiifn =h 

fl 2 9^11 9^12 

But since f i3 are the coefficients of a negative definite quadratic 
form, 

(18) / 11 / 22//12 > 

Since (17) and (18) are inconsistent, the hypothesis is contra- 
dicted. 

B. Case II. 

In some ways, Case II in the monopoly situation is the sim- 
plest of all cases, for, irrespective of the commodity relationship 
that prevails, this modified Edgeworth phenomenon is possible. 
That is, Case II may occur if the commodities are either com- 
plementary or competitive in consumption. Examples of the 
two situations, in the order designated, are provided by the 
following sets of demand functions: 

i X l X 1 X 2 X \ X 1 ^2 2’> 

l P 2 = —\ X 1 — 2x l X 2 + 2 X 2 + 4 *1 — 5x 2 + 2 ; 

f Pi = 2 X 1 ~ X 1 X 2 ~ i X 2 + #1 + + 2, 

l P2 — 2 X 1 ~ X 1 X 2 “ i X 2 + X 1 + 4 - 

postulating a tax, and then differentiating the first-order conditions with 

respect to the tax rate. Next assume that the tax is an incipient excise and 
substitute the assumed equilibrium values of x { into the n it equations, obtaining 
two simultaneous equations in the unknowns x it . These may be solved and 
substituted into the expressions for f ijt completing the check. 
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Y. Summary 

We are now in a position to summarize the results of this 
investigation. 

A. Competition: 

1. Professor Hotelling has proved that the two related com- 
modities must be competitive in both consumption and produc- 
tion if the prices of both are to be lowered by the imposition 
of an excise tax on one of the goods. 

2. Price of taxed commodity lowered, price of other good 
raised: 

a. this situation can result if the two commodities are com- 
plementary in both consumption and production; 

b. impossible if commodities are substitutes in both con- 
sumption and production, or if they are either complements in 
production and substitutes in consumption or substitutes in 
production and complements in consumption, 

3. Price of taxed good increased, price of other decreased: 

a. possible if the two commodities are substitutes in both 
consumption and production, complements in consumption and 
substitutes in production, or complements in both consumption 
and production; 

b. impossible if the two commodities are substitutes in con- 
sumption and complements in production. 

B. Monopoly: 

1. Professor Hotelling has proved that the price of both 
commodities may be decreased by a tax on one of them provided 
the commodities are substitutes in consumption. 

2. Price of taxed commodity lowered, price of other raised: 

a. possible if commodities are complementary in consumption; 

b. impossible if commodities are substitutes in consumption. 

3. Price of taxed commodity raised, price of other lowered: 
this case is possible irrespective of the consumption relationship 
which prevails between the commodities. 




PART Y 

On Econometric Methods 




RAGNAR FRISCH 


Market Prices vs. Factor Costs and the 
Constancy of Production Coefficients 


The question of whether it is most appropriate to use market 
prices or factor costs in national income statistics and more 
generally in economic model work, has been much discussed. 
I shall offer some comments on this question. Or rather, I shall 
look at this whole question from a somewhat more general 
viewpoint more adaptable for decision model work. Instead 
of factor input as distinguished from indirect taxes (minus 
subsidies), I shall consider the proportional input elements as 
distinguished from the residuum. 

I will try to bring out that in so far as the global national 
product is concerned, the effects of including only the (propor- 
tional) factor costs instead of including the total market values, 
disappear to a large extent when the values are deflated so as to 
bring out the volume figures instead of the value figures. 

In so far as the production coefficients are concerned, the dif- 
ference between alternative kinds of figures is more complicated. 
We have to distinguish between at least four different types of 
magnitudes: strictly physical quantities, volume figures, semi 
volume figures and current values. For each of these types we 
must look into the question of the constancy of the coefficients. 

For practical reasons the usual input output coefficients are 
as a rule computed as ratios between market values observed 
in a base year. This is also done in Norwegian work and in a 
general way I agree to this procedure for the reasons given in 
the sequel. There are, however, special purposes for which some 
modifications may be used. Compare the comments below on 

the method followed in the Oslo median model. 

<% 

A. The Strictly Physical Structure 
To bring out the essence of the problem as it appears in a 
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decision model, let us first consider a table with all final deliveries 
aggregated, and with only strictly physical quantities involved 
so that no vertical su m mations are possible. The result of such 
a set up is given in Table 1. The strictly physical quantities 
are denoted by small letters. 


Table 1 

Input-Output of Strictly Physical Quantities 



Receiving sector 
No. 

Final 

delivery 

Total 

delivery 

h = 1 

2 

Delivering 

k = 1 

0 

X 12 

^ i * 


sector No. 

2 

*21 

0 

^2* 

*2 


Labour 


w 2 

— 

— 

Primary 






input 

Non compe- 






titive imports 

h 

h 

— 

— 


If we do not impose any other relations than the definitions 
of the total products, i.e., 

(1) *12 + *1* = *1 

*21 * 2 * ^ *2 

we have 10 — 2 = 8 degrees of freedom. 

If we introduce the 6 production coefficients by 

/ / 

/y. /yi /yt /y» />• /y* 

x 12 — **12 *91 — ^21^1 

(2) w 1 = w[x x w 2 = W 2 x 2 

b% = = ^ 2*2 

and for the moment consider all these coefficients as variables, 
we have 16 variables and 8 equations, hence still 8 degrees of 
freedom. As basis variables we can choose for instance the 6 
coefficients and x v x 2 . Or we can choose the 6 coefficients and 
# 2 *- The 8 basis equations are in the first case (2) together with 

(3) x lx = X 1 *12 *2 * 2 ® = * 21*1 + *2* 

In the second case they are 

*i* + *12*2* *21 *1* "t” *2* 

- - x 2 — — 7 

1 *12*21 1 *12*21 


W 
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together with the 6 expressions obtained by inserting (4) into (2). 

This is the structure of the system expressed in strictly 
physical terms. The case of constant coefficients is covered 
simply by putting the coefficients in the basis equations equal 
to their constant values. This leaves us with 2 degrees of freedom, 
which may, for instance, be unfolded by x lt x 2 or by a^*, x^. 

B. The Structure Expressed in Volume Figures 

Now let us introduce a set of product prices n v jt 2 and factor 
prices n w , n b . We call them standard prices. Let us see how the 
various constellations of the system which are physically possible 
with the degrees of freedom in (1), can be expressed in the value 
terms derived from the standard prices. We also introduce 
residual items %, e 2 in the two production sectors. The residual 
items may be the sum of taxes T h and net profits (savings) 6 h . 
For practical purposes in a decision model these residual items 
are very important, but their introduction causes considerable 
complications in the definitional set up. These difficulties we 
must consider in a systematic way. 

The new figures are listed in Table 2. We could, if we wanted 
to, introduce different wage rates in the two sectors and also 
different import prices, but that is unessential in the present 
connection. 

Table 2 


Interflow Table of Values Reckoned at Standard Prices and with Standard Residuum 

Elements 




Receiving sector 
No. 

h = 1 2 

Final 

delivery 

Total 

delivery 

Delivering 

k = 1 

0 

**1 X 12 



sector No. 

2 

71% #21 

0 

**2 a? 2 * 

7Z 2 *2 

Primary input 

Labour 

Non compe- 

Tlfif W x 7t w W 2 


0 


titive im- 
ports 

71 ^ Tib ^2 

-nt(\+b s ) 

0 

Residual' 11 input 


*1 


-(«i+«s) 

0 

Grand total 



7t 2 X 2 

0 

i TCj 7l 2 
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The prices 7 t x and are actual prices per physical unit, say 
per kilogram or per kWh. The wage rate n w is also reckoned 
per physical unit, say per hour of work. Similarly for n b . The 
residual input is measured in money. 

It should be understood that Tables 1 and 2 exist simul- 
taneously, and that the actual physical quantities in the two 
tables are the same. 

In Table 2 we have imposed the condition that the sum in 
column No. h shall be equal to n h x h . This is equivalent with 
defining the residual e h when the prices are given. Or we may 
inversely consider the condition as defining the prices n h in 
terms of the residual inputs e x , s 2 . We will most of the time 
adopt the latter viewpoint. 

The column sum conditions are expressed by 

^ 2^21 + ™v> W \ + ^ h + h = 

*^T^T2 ""H 2 -f" *4” ^2 ^ ^2 X 2‘ 

We define the residual rates ^ and e 2 by 

(6) e k — s k x k (A = 1, 2). 

If need be, these residual rates will be called direct residual 
rates to distinguish them from certain aggregate residual rates 
to be considered later. 

This gives 4 equations in addition to the 8 we had before. 
The additional variables are 

Number of 
variables 

4 TTj, TZty Wy)) ^1) 

(7) 2 s v s 2 

2 e v e 2 

8 Total 


In other words we have 8 — 4 = 4 more degrees of freedom 
than in the table of strictly physical quantities. As additional 
basis variables we choose the factor prices n wi n h and the jesidual 
rates s v e 2 . Using x v x 2 rather than x l!js and as basis variables, 
the total set of basis variables will be 



PRICES VS. COSTS AND PRODUCTION COEFFICIENTS 


195 


tty)* fi'b 

(8) fiy 

x 12’ ^21 > ^l* ®2> ^i> ^2* 

If all the 6 coefficients listed on the last row in (8) are taken 
as given, there are 6 basis variable left, namely those on the 
first three rows of (8). Of these x lt x,, determine the physical 
constellation of the system — all the other physical features 
following from x 1 , x 2 — while n w , n b , e v s 2 determine the price 
features — the other prices following from n b , e v s 2 . 

The prices n x and as functions of the coefficients and the 
basis price elements, are determined by inserting into (5) from 
(2) and (6) which gives the system of two equations 

7t x ^2*^21 = "t" rC b b x H" ®1 

" ^'1*^12 l ^2 I* ^6^2 I ^2* 

The solution of this is 

n w {w'i + w 2 x 21 ) + XbiK + ^2^2i) + (gi + e 2 x 21 ) 

1 — ^12^21 

n w( W l X l2 + W 2) + ^b(W X 12 + ^ 2 )+ (^ 1^12 + ^ 2 ) 

1 — x 12 x 21 

In the case of n sectors (9) has the form 

(11) = 7i w w' h + n h b' n + a h [h = 1, 2 ... n) 
which is solved by 

( 12 ) n h = 2 frwK + n b b' h + h) (<5 — x ')nl (k = \, 2,. . . n). 

»- i 

The formulae (10) show that if not only the factor prices, 
but also the direct residual rates are constant, the product 
prices will also be constant. In other words the whole structure 
of standard prices will be fixed. We consider them as base 
prices and take the corresponding values as defining volume 
figures. 

We put 
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(13) 

Xlc — 7*k x k 

— ?Ck x k* 

(14) 

X kh = 


(15) 

§ 

£ 

l) 

B h = 7t b b h . 


The volume figures defined by (13) — (15) are entered in 
Table 3. That is to say, if the numerical figures are entered, 
Table 2 and Table 3 will be exactly the same. 

Keeping the price structure — as defined through n m , n b , e v 
s 2 — constant and varying x x and x 2 , we get different constella- 
tions of the volume figures. Since it is simply a question of units 
of measurement to pass from the system of strictly physical 
quantities to the volume figures, we can just as well think of 
X x and X z as varying under constant n w , n h , s x , s z . 


Table 3 

Interflow Table of Volume Figures Reckoned under Base Year Prices and Base 
Year Residual Elements 




Receiving sector 
No. 

Final 

deliveries 

Total 

deliveries 



h = 1 

2 

Delivering 

k = 1 

0 

X* 

Xv* 

X x 

sector No. 

2 

X 21 

0 

x„ 

X t 


Labour 

w. 

W, 

-(W 1+ w 2 ) 

0 

Primary input 

Non compe- 
titive im- 
ports 

B i 

B , 

~(B l +B i ) 

0 

Residual input 


£ 1 



0 

'Grand total 


Xi 

X, 

0 

X x +X a 


The introduction of the volume figures, i.e. the magnitudes 
denoted by capital letters in (13) — (15), does not change the 
number of degrees of freedom because to each new magnitude 
corresponds one definitional equation. If we assume that in 
(8) not only x x and x 2 , but also s x and e 2 are changing while the 
factor prices n w and n h as well as the 6 physical quantity coef- 
ficients are given, we have 4 degrees of freedom. (In ^sectors 
2 n degrees of freedom). From the discussion in the sequel it will 
.appear that we will reserve the terminology volume figures, as 
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defined through (13) — (15), to the case of constant residual 
rates. These constant rates we can assume as observed by the 
actual situation in a base year. Now there remain only two 
degrees of freedom in the volume figures. They may be unfolded 
say by X x and X 2 . (In n sectors n degrees of freedom). 

Table 3 has at the same time the following two properties: 
(1) the magnitudes entering into it have the character of volume 
figures (because they represent values at base year prices), 
and (2) vertical summations in the table are possible. 

I shall look a little closer into the particular aspect of the 
question that is represented by the constancy of the residual 
rates. 


C. Constant Factor Prices and Constant Residual Rates Entail 
Constant Input-Output Volume Coefficients. 

We define 
(16) 

(17) w ’^ir B ' h = Y' 

From the definitions (13) — (15) follows that the coefficients 
(16) — (17) are equal to 

(18) 

n il 


(19) 


tt r/ ^ W f 

n h 


B' 



where the n k are given by (12), and n w , are the given factor 
prices. If all the production coefficients x 12 , x' 21 , w' v w 2 etc. 
reckoned in strictly physical quantities are constant, we see from 
(10) and (16) — (17) that constant factor prices and constant 
residual rates s k entail constant X’ kh , W' h and B' h . 

The conditions of constant residual rates £ x and i 2 can be 
transformed into corresponding conditions about the residual 
rates expressed as fractions of X x and X t . Indeed, from (6) 
and (13), we get 
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( 20 ) 

where 

( 21 ) 


e k — 


k 




% 


The last formula taken in conjunction with (10) shows im- 
mediately that if we have constant factor prices and constant 
quantity coefficients, constant e k rates will entail constant s k 
rates. 

On the other hand, if the marginal rates e k are given, instead 
of the s k) we deduce from (9) by inserting from (21) 

^ 22 ) n 2 X 21 ^ n w W l + 

From the equations (22) the n k are determined. That is to say: 
Constant coefficients in the strictly physical structure, constant 
factor prices and constant residual rates e k entail constant 
prices n k and hence by (21) constant s k . 

It is the same set of prices n 1} n 2 that is determined from (9) 
and (22) only the data are taken in a slightly different form. 
The generalization to n sectors is obvious. 

This means that if the residual rates e [ and e 2 as defined by 
(20) are constant, we can reason about the volume figures in 
Table 3 very much in the same way as we can about the strictly 
physical quantities. 

To be more precise: In Table 3 there are 12 variables connected 
by the 12 equations 


55=8 ^21 + + B 1 + e 1 = X n + X u 

X 2 = X u + W 2 + B 2 + s 2 = X u + 

and (16) — (17) and (20) where the coefficients X kh) W' h , B' h 
and e h are constants (and hence may be determined by observing 
the content of Table 3 in a base year). The first two equations 
in (23) reduce to conditions on the coefficients. Hence two 
degrees of freedom in the variables. This checks with the remarks 
in connection with Table 3. 

The two degrees of freedom that remain in Table 3 under the 
conditions specified — constant physical coefficients,, constant 
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factor prices and constant residual rates — may be generated 
by letting X ± and X 2 vary. Or we may use X lJie andX 2 * as basis 
variables and use other equations to express X x and X 2 . 

D. Constant Coefficients in the Volume Sense Entail Constant 
Residual Rates 

If we assume a model of Table 3 where (16) — (17) hold 
with constant coefficients, we can conclude that the residual 
rates e k defined by (20) are constants. 

Indeed, introducing into the left hand equations in (23) — 
which follow from the balancing principles of Table 3 — the 
expressions for X h> X kh) W h) B h from (16) — (17), we get 

, 24 , h=l- (Ki + W' 1 + Bi) 

1 ; 4 = 1- (^2 + w’ t + B'). 

Hence: If X' kh , W' h , B' h are constants, s[ and e 2 must also be 
constants. We are thus back to the same type of analysis as was 
discussed under subsection C. 

Having reduced in this way the whole formulation to the figures 
contained in Table 3, we may drop the assumption of an under- 
lying strictly physical structure which we started from, and 
simply reason about the figures of Table 3 as value figures 
reckoned at base year prices. This formulation will apply even 
though there is a great variety of individual goods that enter 
into each aggregate X k or X kh etc. For all practical purposes these 
figures could be interpreted as volume indices. And it would 
seem plausible in many cases to make the assumption of constant 
input-output coefficients reckoned in such figures. 

If we take the volume figures as the basis of the analysis, the 
product prices n lf n 2 become indetermined, and the same is true 
of the factor prices n wi n b . Indeed, if in (22) we insert for x kn 
from (18), and similarly use (19) we simply get back to (24). 
The product prices and factor prices can now simply be looked 
upon as conventional multipliers by which we define “the 
strictly physical quantities" in (13) — (15). If the “strictly 
physical quantities" are well defined and observable, we can, 
of course, deduce the factor prices n w> n b and product prices 
n lt n 2 that must prevail in order that we shall get the observed 
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volume figures (in base prices year prices) X lf X 2 , W lt W 2 etc. 


E. The Aggregate Residual Rates 

The residual rates s ' h express the input of residual substance 
that is made directly into sector h } reckoned per unit of total 
output X h from sector h . We can also consider the aggregate 
residual rates e h defined by 

(25) e h = that part of X h which is due to the input of residual 
substance in any sector, assuming that all residual 
substance is everywhere passed on to other sectors or 
to final output in the same proportion as the volume of 
cross deliveries or the final deliveries. In other words 
all units of output from a given sector contains the same 
amount of residual substance. 

When the aggregate residual rates are defined in this way, 
they must satisfy the equations 


(26) 


£ 1 £ 2 ^21 


t 



— £iZi 2 + £ 2 


The first equation in (26) is obtained by noticing that the 
total outflow of residual substance from sector 1 is e[ X v This 
must be equal to the total inflow of residual substance into 
sector 1, namely the residual substance entered directly into 
sector 1 — this is e[X x — plus the residual substance that is 
entered into sector 1 through X 2l — this is 4-^21 — • Dividing 
this equality by X v we get the first equation (26). Similarly 
for the second equation in (26). 

The inputs of labour W h and imports B h axe not to be entered 
in the above account as they are by definition not residual 
elements. But we could have singled out, say W k , and considered 
the direct coefficient W' h as distinct from the aggregate coefficient 
W\. The reasoning would be the same as in (26). 

The generalization of (26) to n sectors is obvious, namely 

(27) £ e k (8 - X') kh = 4 (h — l, 2, . . . n). ' 


The solution of this is 
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(28) e k = ^e' h (d-X')-l (k = 1, 2, . . . »). 

7i=l 

If we take the volume figure coefficients X' kh etc. as data , 
the meaning of the matrix in (27) is clear. If on the other hand 
we go back for a moment to the interpretation in terms of strictly 
physical quantities and with constant physical coefficients and 
constant factor prices, we must remember that the volume 
figure coefficients X' kh in (27) depend on the e' h . The volume 
figure coefficients X kh will indeed in this case have to be looked 
upon as determined by (18) where the n k are given by (22), 
and hence depend on the e' h . This means that if we fall back on 
the constancy of strictly physical coefficients, we cannot deter- 
mine the e k for different e h by retaining the left member matrix 
in (27) and just changing the right member vector e' h . Both 
the matrix and the vector will have to be changed. On the other 
hand, if volume figure coefficients are taken as given, we cannot 
change the e h but must let these magnitudes be determined by 
(24). The equations (26) have therefore no use for determining 
the residual rates. The only purpose of the equations is to pass 
from the direct rates e' h to the aggregate rates s h or vice versa. 

It is a fundamental proposition in input-output theory that 
equations of the form (26) will have non negative solutions 
s h if the e h are non negative. 

If we multiply the first equations in (26) by X t and the second 
by X 2 and add the equations, we get, using the equality between 
the left and right members of (23) 

(29a) + e 2 -^ 2 * = e i ”1“ e 2 * 

That is to say the total residual substance contained in the final 
delivery is equal to the total residual substance put into the 
system. 

Since the s n are non negative (and at least one of them positive 
if at least one of the s h are positive), we see that the total residual 
substance contained in the sectors products must be larger than 
the total residual substance put into the system, i.e. 

(29b) s^X-^ ~j— S-^X 2 "f“ ^ 2 * 

This double counting which prima facia appears a little puzzling, 
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is easily explained: The global product X ± -f~ X 2 has itself 
emerged after some double counting. In X 1 + X 2 is indeed 
included not only the total primary and direct residual input, 
but also all crossdeliveries. This follows by taking the sum of 
the left hand equations in (23), which gives 

(30) X x +X % = (W 1 +W 2 ) + (B 1 +B 2 ) + (e 1 + s 2 ) + (X l2 + X^ 

That is to say the sum of all sector products will increase if we 
split the sectors further up. 

The double counting is only in the total sector products, not 
in the aggregate residual rates e lf e 2 as is seen from (29a). 

For clear thinking in the variety of situations that arise ac- 
cording to the various systems of assumptions adopted it is 
essential to be very careful in the notation. It is indeed safe to 
be so explicit as nearly to appear pedantic. 

We will from now on let the symbols used in subsections C and 
D, i.e. X k , X kh) X kh etc. and the corresponding coefficients be 
strictly interpreted as the volume figures and volume figure 
coefficients, that appear when the residual rates e' h are constants 
and have a specific set of values. 

These volume figures themselves are recorded in Table 3 and 
the corresponding coefficients are defined in (16) — (17). With 
given and constant coefficients the degrees of freedom in this 
model is, as already stated, equal to n , the number of sections, 
i.e., in Table 3 it is equal to 2. 

F. The Structure in Semi-Volume Figures 

An essentially new situation arises if we drop the assumption 
of constant residual rates. We can discuss this situation by going 
back to the structure expressed in strictly physical terms. We 
assume constant technical coefficients in this physical structure 
and also constant factor prices, but the direct residual rates may 
now be changing. And as they change, they will produce changing 
product prices and hence changing value figures. These value 
figures we will term the semi-volume figures. In this way of 
thinking there are 2n degrees of freedom, represented, ^y, by 
the n physical quantities and the n residual rates. 

Instead of discussing the semi-volume structure by the help 
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of the strictly physical quantities, we can also do it through 
the residual rates and the volume figures as they were defined 
under subheadings B — E, see in particular the comments in 
the last part <5f subsection E. 

All the semi-volume magnitudes will be denoted by the super- 
script sem (standing for semi- volume). 

The new situation will be described by a table similar to Table 2 
namely by Table 4 and it is through the balancing equations of 
this new table that the product price concept gets a meaning. 

Using an interpretation in terms of the strictly physical 
structure, we are particularly interested in the connection between 
the 7t£ m and sf m and ef m . 

We put up the following definitions, which are similar to 


(13)- (15). 

(31) 

•xr = 

(32) 

ysem — ^sem 

— n k x Jeh 

(33) 

TT/sem _sem psem _sem l 

= n w w h = 7t 6 * 0 h . 

The semi-volume figures Xf™ etc. measure the production 

levels, the cross deliveries etc. when the residual rates are 
chosen as e k em instead of the e' k that are associated with the 
measurement of the volume figures X k , X kh etc. 

In general 

we will assume 

(34) 

Tvsem sem __ 

n w — 7Z W TZjj — 7€ b 


but for the symmetry of the formulae we may retain the notation 
n s * m and nl em . Instead of Table 3 we now get Table 4. 

It should be understood that Tables 2, 3 and 4 — as well as 
a table similar to Table 2 with sem added as superscript on the 
7i and s — exist at the same time. 

In the complete system now considered we again have 2 n 
degrees of freedom which may be unfolded by, say, the X k 
and the In the case of 2 sectors, there will be 4 degrees 
of freedom. In the strictly physical system we also had 4 degrees 
of freedom, when the factor prices n w and as well as the 6 
production coefficients in (8) were given. 

If we lean on the interpretation in strictly physical quantities. 
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it is easy to indicate what the semi-volume figures will be in 
terms of the volume figures X k , X kh etc. 


Table 4 » 

Interflow Table of Semi-Volume Figures Reckoned under the Prices that Prevail 
when Factor Prices are Constant and Residual Rates are Arbitrarily Given 



Receiving sector 1 
No. 

Final 

deliveries 

Total 

deliveries 

h = 1 

2 

Delivering 
sector No. 

h = 1 

2 

0 

v sem 

A 21 

v sem 

A 12 

0 

■trsem 

A l* 

yS61H 

*^2* 

^.sem 

T^sem 

A 2 

Primary Labour 

input Non compe- 

titive im- 
ports 

jpsem 

B sem 

£ sem 

rySem 

-(Bf m +B| em ) 

0 

0 

Residual input 

sem 

H 


0 

Grand total 

v-sem 

A 1 

^sem 

A 2 

0 



Indeed, adding the superscript sem for the price elements in 
(22) (except for the factor prices, which are the same) and using 
(31) — (33), we get 


(35) j>r (i - - < m 4 . = + % k 

2 + < m (i - 4““) = « + 

where 


(36) 

Similarly we get 


c 'sem 



(37) Trf” — nf m z' 21 = n w w k + n b b'i + 

- = n w w' z + nX + e? m 

where 


(38) 

i.e. 


o sem 

E k — 


_sem 

e k 

X k 
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(39) «; sem = — . 

We may loot upon the two sets of prices n k and simply 
as special values assumed by the product price functions for 
different values of the residual rates considered as arguments 
in these functions. 

Through (13)— (15) and (31) — (34) we get, remembering 
that the strictly physical quantities x k , x hk etc. are independent 
of how residual rates are chosen 



the p k m are index numbers of prices, with the base situation 
chosen as the situation in relation to which the volume figures 
are defined. These index numbers have a meaning even if no 
strictly physical quantities are defined. For p k m = 1 the semi- 
volume figures are equal to the volume figures. 

The number 2 n of degrees of freedom is not changed by in- 
troducing the semi- volume figures through (31) — (34) or by 
introducing the price indices through (46). Indeed, to each 
new magnitude introduced corresponds a definitional equation. 

To unfold the 2 n degrees of freedom, we may use the volume 
figures X k and the residual rates 4““ defined by (36) (the e k are 
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fixed as mentioned above, see in particular the comments to 
(13) — (15)). Instead we may use as basis variables the X k and 
the fil em - Or the and the X k . Or some other linearly in- 
dependent set of 2 n of the variables entering into the complete 
set up. 

From (35) we get by (46), (18) and (19) 

(47) tf“(l - < $em ) ~ firX'n = K + B[ 

- PTX'u + pro- - 4 sem ) = K + K 

The coefficient X' lt , X’ 21 etc. in (47) are determined by (16) — 
(17) applied in the base year situation. The fir are therefore 
well defined as functions of the 4““. The generalization to n 
sectors is obvious. 

We could also have considered the semi-volume residual rates 
in the form 


(48) 


X* 


By (36) and (40) this is the same as 
(49) «g“ = 

With the e k m given (47) takes the form 

(so) pr - prx't i = w[ + is; + 6-f m 

-i>rx' 12 + pr = w % + b' + §r. 

Note the analogy — and also the difference — between the 
equations (47) and (50) on one hand and on the other the 
equations (22) and (9), and also the equations (35) and (37). 

For e“ m = 4 we should by (46) get fir = fir = 1. That 
this is in fact so, is seen by inserting these values for the fir 
and comparing with (24). 

The solution of (50) is 


(51) 




Wi + W' t X' KL + (B[ + B’ 2 X’r + (ar + srX'n) 


i - x' a x' n 


P% 


(W'X' 12 + W' 2 ) + (BjX' u + B' t ) + (C m ^2 + C m ) 


Again the generalization to n sectors is obvious. Instead of 
(51) we get 
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(52) 2Pr(*--X') kh =W' h + B' h + er (*=1,2...*). 
The solution of this is 

(53) pr = i (W’ h + B' h + s-n (d - X')£ (k = l, 2 ... n). 

7i=l 

The last formula suggests immediately the following three 
component parts of the price p s k m 

n 

W k = XW' h (d-X') $ due to labour input anywhere in 

A=1 the system. 

n 

(54) = 2 Bj(<5— X')^ due to imports anywhere in the 

ft=1 system. 

n 

e -aaa _ ^ £* em (<5— due to residual input anywhere 
ft=1 in the system. 

The last expression in (54) is aggregate residual substance in X k 
reckoned per unit of X k . It satisfies the equations 

(55) ££*” - e'i em X' n = ef m 

'sem j£'sem _r *sem __ -sem 

C 1 x 12 l c 2 °2 

where as before £^ em is direct residual input reckoned per unit 
of X k . These equations are analogous to (26). In both cases the 
residual substance is reckoned per unit of the sector product 
measured in volume figures. In n sectors (55) is written 

( 56 ) | eT(6 - X') kA = er (h = l, 2... n). 


G. The Semi-Volume Coefficients and a Modified Definition of 
the Sector Products 

In analogy with (16) — (17) let semi-volume coefficients be 
defined by 


(57) 


v'sem 

Y^sem 

^ kh V'sem 

A h 


w, 


W'^ = x , 


h jgf sem __ 


lx 

xr - 


(58) 
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Note in this connection (42). 

Inserting from (43) — (44) into (57) — (58), we get 


(59) 

(60) 


v-'sem 

^kh 


PT 

pr 


X kh i* e * 


V'sem 

A Tch ““ 


PTX™ 
pr* » 


W’i 


: -h W' h and B?* 

^pstm n n 




This shows that if the volume coefficients X' kh are constant, 
the semi-volume coefficients cannot be constants under changes 
in residual rates, because, such changes will by (51) make the 
price indices p k m change. There would, of course, be no logical 
inconsistency in assuming the semi-volume coefficients constant, 
but then the volume coefficients would change under the changes 
in residual rates. 

The real question at issue is to know which is the most 
realistic assumption. 

If we assume such a market organization and such a techno- 
logical structure that an increase in the price of a product will 
cause an equally large relative decline in its use for cross delivery 
as well as for total delivery, then the semi-volume coefficient 
would be constant while the volume coefficient would change. 
This is seen from (59) — (60). : 1 ) 

But if we can assume fixed coefficients in the strictly physical 
structure, then the volume coefficients must be constant. In 
what follows I will assume constant volume coefficients. 

Then a second question arises: Can we modify the definition 
of sector product in such a way as to compensate for the varia- 
bility in semi- volume coefficients? 

An obvious answer is that if the volume coefficients X kh are 
known and also the residual rates — either in the form e^ m or 
in the form s k m — the price indices p k m will follow by (47) or 
(50). Hence we can always by (43) — (44) compute “compensated” 
variables — namely the X h and the X kh — which are such that 
they will be connected by the constant volume coefficients. 
This procedure is however highly non linear and it does not seem 


l ) How realistic such a case would be, is another question. 



PRICES VS. COSTS AND PRODUCTION COEFFICIENTS 


209 


very promising to proceed to a study of the semi-volume 
variables along such lines. 2 ) 

Starting from the concepts of semi- volume figures it is, however, 
possible to introduce certain modified definitions of sector 
products and cross deliveries which are such that they are 
approximately related through the constant volume coefficients. 

To arrive at such a formulation we will first rewrite the 
expressions (51) in the forms 


( 61 ) 


pr = 


tr= 


(fj^ — g l)+( g 2 em ~ 4)^21 

i - 

+ (gr-«2) 
l-X^X^ 


The first of these equations follows by writing the numerator 
in the first equation of (51) in the form 

(62) +X , a (W' 2 + B 2 +s' 2 ) 

+te" n -*D+x; i(C m -4)- 

The first and second parenthesis here are respectively (1— X' 21 ) 
and (1 — X[ 2 ) by (24). This part of (62) therefore becomes 
(1 — X 21 ) + ^ 1(1 ~ Xiz) = 1 — X' 12 X 21 . This establishes the 
first equation in (61). Similarly for the second equation in (61). 

As a check on (61) we see that p* m and p 2 m reduce to 1 if 
e 2 m — and s| em = 4- 

In the case of n sectors, we have 


(63) pr = 1+1 WT - 4 ) (-5 - *')» (k = 1 , 2 . . . n). 

h=l 

In the regular case the coefficient of (sl em — e^) in the first 
equation of (61), namely 1/1 — X' 12 X 21 (* n general: the diagonal 
element (<5 — X ') jJ) will be slightly above unity, while the 
coefficient of (e| em — e 2 ) in the first equation of (61) will be 
small since it is multiplied by the coefficient X 21 , and may 
therefore be neglected in a first approximation. Similarly in 
the second equation in (61). That is, we have 

(64) ^ p s * m = l+fi| em — e* (approximately) {k = 1, 2 . . . n). 

2 ) The prices are by (50) linear in the e| em , but by (47) non linear in the 
e£ sem . In any case the deflation by the prices makes the set up non linear. 
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In other words, as a first approximation the price p k m depends 
only on the residual rate i k and not on the other residual rates. 
And the relation is a simple addition. The "dependency” we 
speak of now is an (approximate) accounting dependency which 
hold good regardless of behaviouristic relations. 1 ) 

Multiplying (68) by X k , we get 

(65) X k m —e k m = X 1c (l—e' k ) (approximately) (k = 1, 2, 

The input-output coefficient in semi volume figures, i.e. 
X^ em as defined by (57) is not constant. There is, as is seen from 
the left hand expression in (59) a correction to be applied in the 
numerator as well as in the denominator in order to reach 
something that is constant. The correction in the denominator 
can be done with the approximation (65) simply by using the 
left hand expression in (65) to measure the sector product 
instead of Z| em . 

We will first consider the case where we make this denominator 
correction without making the numerator correction. Is this a 
sound procedure? 

By analogy consider the difference (x 1 — x 2 ) between two 
stochastic variables. The variance of this difference will be 
equal to var. x x + var. x 2 — 2 rV var. x x • var. x 2 where r is 
the correlation coefficient. This expression is larger than var. x 1 
if, and only if V var. # 2 /var. > 2 r. Therefore, if we know 

that var. x 2 is appreciably larger than var. x v it will pay to 
correct x 2 — that is making it non stochastic — even if we do 
not correct x v And this will apply regardless of the nature of 
the correlation, whether positive or negative. 

In our case the question is if we shall correct for p s * m in the 
denominator of the expression to the right in (59) even if we do 
not correct for p k m in the numerator. We know that a change 
in p k m will produce a change in pf m in the same direction (positive 
correlation), but the change in p s k m will be proportionally much 
smaller if there are many highly intertwined sectors. Hence we 

*) It is the equation itself, i.e. (64) — or more exactly (61) — which has 
accounting character. This, of course, does not prevent one or more of the 
variables from entering into some other relations that are behaviouristic. The 
expressions "accounting” vs. “behaviouristic” can be used about a relation, 
not about a variable. 
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ought to get a more correct result by correcting for even 
if we do not do it for pf m . 

The above argument is particularly adapted to the case where 
there is a change in the residual rate in a single sector. To some 
extent a similar reasoning can be applied in succession to any 
of the sectors. In each step the correction contemplated will be 
better than nothing. But it is quite clear that if all residuals 
change simultaneously, there may occur cases where it would 
have been better to make no corrections at all in the variables. 

For instance if all 4 sem are equal — i.e. 4 sem independent of 
k — we see from (72) that we obtain a better approximation 
by not making any corrections on the variables, because in this 
case X s m is a constant times X™ m . The constant is equal to 
(1 - 4/1 - e')X' h . 

On the other hand if (£ fc sem — 4) is independent of k , and hence 
by (64), pf m independent of k , we see from (57) — (59) that 
Xk™ is again a constant times X h , but now the constant is simply 
Kk- 

These cases where the semi volumes figures themselves are 
connected by constant coefficients are, however, very special. 
They resemble the case where the residual rates are constant 
and we get the volume figures. 

If we want an approximation that holds — at least roughly 
— for any changes in residual rates — in particular for changes 
with a small covariance between the individual residual rates — 
the correction of the denominator to the right in (59) — which 
leads to (68) — seems to be a workable formula. 

The correction for p% m can be achieved simply by starting 
from the exact relation 

( 66 ) Xf? = PrKnXn 

and introducing here the expression for X h taken from (65). 
This gives 

(67) X%? = fr I" r— (*r - 4 em ) (approximately). 
•» L1 — 


Dropping at this stage the correction pf m , we can write 
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(68) = [rzj} (^r - 4 em ) (approximately). 

The expression in bracket is a constant and can be determined 
from the data in the base year. (If the sector product has been 
defined as (X h — e h ) already in the base year where the coef- 
ficients were determined, the value of the bracket will emerge 
directly.) 

This procedure, while rough has the great advantage that it 
keeps the model linear, and it will as a rule — compare the 
discussion above — at least be better than simply to put 

(69) X%? = X' kh Xr (incorrect) 

in a case where the residual rates do not remain constant. 

It is possible to make the first order correction also for the 
factor p k in the numerator to the right in (59) but then the 
model does not remain linear. Indeed we have 

( 70 ) Pt m = ~ (exactly). 

■A-k 


Introducing here for X k from (65), we get 


(71) 


fit 


jgsem 


ysem e sem 

b k 


(1 — e k ) (approximately). 


And inserting this in (67), we get 1 ) 

x s * m ni — £)x f n 

/f7o\ V'sem * I V kJ kh ( v'sem „sem\ 

\ 1 *) ^kh ~~ vrsem i ) 

-A-jc Gjc L 1 


(approximately) . 


H. The Formulation in Non-Residual Cost 

Let us introduce the aggregate non-residual fart of the price 
•of the output from sector k. This is that part of p s k m which is 
■due to the input of primary factors (in Table 4) labour and 
imports). This part of f k m is given as the first two terms in 
(54) which — in the case n = 2 — are given by the terms in 

*) Dividing by the first fraction in (72), the left member becomes — £* sem ) 

If this is taken as definition of a corrected cross delivery, we get a relation with 
•constant coefficient. But this relation is not linear in and ej em . 
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(51) that do not depend on and e| em . 

We denote this part 

(73) tr = i(W' A + B' h ) {d - X')~t (*=1,2 

ft = l 

As long as the production coefficients reckoned in volume 
figures are constant, the prices (73) are constant. The correspond- 
ing non residual parts of the sector products, cross deliveries 
and final deliveries are 

(74) xr = pi° d x k xif = firx» 

(75) X™ = prx kh 

(76) wr = w h B™ = B h . 

Since these non residual parts of the volume figures are simply 
proportional to the volume figures, nothing is gained by working 
with these variables instead of the volume figures. Both sets of 
variables will exactly satisfy relations with constant coefficients, 
provided the volume structure coefficients are constant. No 
further attention will therefore be paid to the structure in non 
residual parts. 

I. Formulation in Factor Costs 

Finally we will consider a breakdown of sf m in the two parts 

(77) e% m = (5“ m + T k m {k = 1, 2. . . . n) 

where <5* m stands for profits and T k m for taxes. More specifically 
we may interpret <5“ m as profits before the deduction of direct 
taxes, so that Tf m will stand for indirect taxes. 

We have now n more degrees of freedom, i.e. 3 n degrees 
altogether. They may be represented, say, by the X k , and the 
rates e“ m and T k m , where is defined by (48) and 

ssem 'j'sem 

(78) 3r=-£- Tr=-~ 

■X-lc 

so that 

f sem = ^sem + fsern 

Assuming that the coefficients in the volume structure are 
constant, we know that if the are changed, the p k m must 
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follow in the way previously discussed. This, however, does not 
say anything about the way in which the prices will change if 
the T k m change. Conceivably any change in the T k m may be 
compensated by opposite changes in the d k m so that the ef m 
remain constant and hence the constant. Or a smaller or 
larger part of the change in may be absorbed in the prices. 

In a market of a more or less conventional sort it is perhaps 
plausible to assume, as a very simple case, that the Tf m will 
affect the prices directly and fully in the sense that we have 

(79) p s k m = 1 + T k m — T' k (approximately) (k = 1, 2 . . . n) 
where 


(80) 




h 

x k 


= indirect tax rate in the base year. 


If this is so, we get by a reasoning analogous to that connected 
with (64) — (65). 

(81) Xf m — 7| em = X*(l — T' k ) (approximately) 

k = 1, 2 . . . n). 

so that in analogy with (71) — (72) we may put 

(82) X& = [^^7] (XT - (approximately). 

The expression in brackets is a constant to which we may 
attach comments similar to those connected with (68). 

The set up (81) has been used in the Oslo median model. 1 ) 


J. Formulation in current values 

If we consider also the factor prices as variables, we are led 
to the concepts Xf 1 = p k X k , X“ r = p k X kh , etc. Note that 
p k m stands for the price concepts that emerge when the factor 
prices, i.e. the price concepts for W k and B k , are constant, while 
f k stand for the corresponding concepts when the wage rate and 
the import prices may change. I.e. that emerge when the factor 
price concept is expressed by a general wage index q (with 

l ) The Oslo median model contained several specifications that go beyond 
those considered here, but in essence we can say that what is here denoted 
(js^sem _ j'semj an d respectively, was there denoted X h and 
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q = 1 in the base situation) and the import prices are arbitrarily 
given. Correspondingly the X£ ur and the X£" are current values 
as they emerge when applying the general price indices fi k . 

When the current values are deflated, we get back to the 
volume figures. For any individual sector product this deflation 
is simply 


(83) 


defl. X“ r = 


xr 


x ‘’ = x. 


Pr. ind. (Y;") p. 
For the global output as a whole we have 


(84) 


defl. (X\ ar + X™) 


Xf 


xr 


\T) 


Pr. ind. (X c x ur + X' 

If we use a Laspeyre price index, (84) is further reduced to 


(85) 


"V'ciir i vcur 

defi. (xr + xn = - ~ Zj y =x 1 + x 2 . 

Pl^l + P 2^2 

X x + X 2 


A similar reduction takes place if we deflate the semi volume 
figures. That is to say, in order to arrive at a measurement of 
the global output that is independent of such price effects as 
may be produced by residual input elements, or more specifically 
that part of these elements which is represented by indirect 
taxes, it is not necessary to use measurements as (XJ?" — £™ r ) 
or (X™ r — T£“) for the sector products (compare by analogy 
(65) and (81)). We can use the total value X| em and afterwards 
deflate. 

Another aspect of this question is that such differences as 
(X™ r — ef**) or (X™ r — Tff*) only represent first order correc- 
tions. The complete correction is obtained by computing the 
prices p t , p 2 by formulae analogous to (51) or (53) — or ob- 
serving them — and then using these prices for the deflation 
process. 



LAWRENCE R. KLEIN 


The Efficiency of Estimation in Econo- 
metric Models 


At an early stage in the development of mathematical statistics 
and in its application to economics Professor Hotelling steered 
our thinking in the right direction towards the relationship 
between problems of parameter estimation from samples and 
problems of prediction outside the confines of the sample. The 
essence of the matter is contained in his 1929 paper written 
jointly with H. Working. 1 ) A more concise presentation was 
given later in Hotelling's appraisal of a research publication 
in psychology and sociology. 2 ) Hotelling's formulation of the 
prediction problem will long serve as a model in econometrics. 
In following up recent contributions on the efficiency of alter- 
native methods of estimation in econometrics, it is found im- 
mediately useful to turn to Hotelling's basic formula for the 
standard error of forecast. 

First I shall take up some problems in connection with the 
use of conventional least-squares methods of estimation in 
econometrics and then go on more specifically to the problems 
of prediction which rely so heavily on Hotelling's inspirational 
work. 

A Revival of Support for Least-Squares Estimates 

Recent arguments in favor of reversion to conventional ap- 
plication of the method of least squares for estimating para- 
meters in single equations in simultaneous systems dealt with in 
econometrics have stressed the comparative efficiency of this 

l ) H. Hotelling and H. Working, “Applications of the Theory of Error 
to the Interpretation of Trends/ 1 Journal of the American Statistical Association, 
Vol. 24 (March Supplement, 1929), pp. 73 — 85. 

a ) H. Hotelling, “Problems of Prediction/ 1 The American Journal of Sociology 
XLVIII (1942), 61—76. 
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method. For example, in the discussions at the 1954 meetings 
of the Econometric Society in Uppsala, H. Theil outlined a 
theorem showing that the generalized variance of least-squares 
estimates of thfe parameters in a single equation is at least as 
small as that of limited-information-maximum-likelihood 
estimates. 3 ) It has long been suggested that it rnay be rewar din g 
to gain some efficiency at the expense of bias by using the least- 
squares method, but the formal proof of the superior efficiency 
has only just been set out by Theil. Of course, we must know 
more about the “utility function” of the users of estimated 
models before we can judge about the relative importance of 
bias and efficiency. 

In this paper, I should like to suggest that Theirs results and, 
in fact, much of the discussion on the relative merits of different 
methods is misplaced. In systems of equations, with several 
parameters in each equation, it is misleading to look at in- 
dividual parameters, one by one, or even restricted groups of 
them in reaching overall judgments about the importance of 
bias or efficiency. Users of econometric models are often not 
really interested in particular structural parameters by them- 
selves. They are interested in the solution to the system, under 
alternative sets of conditions. In other, more technical, words 
they are interested in the reduced forms of the estimated 
system. 4 ) It is the difference between 'partial and general analysis 
that is involved. It is conceivable that partial analysis is an end, 
in itself, for some problems — possibly those of a purely peda- 
gogical nature — but most problems call for a more complete 
analysis of the system. 

The transformation of a structural system to its reduced form 
can be associated with the process of forecasting. We shall use 

8 ) “Report of the Uppsala Meeting, August 2 — 4, 1954,” Econometrica , 
Vol. 23, (April, 1955), pp. 204 — 5. Limited Information, and other methods of 
estimation in econometric models are described in various publications. A good 
summary is found in Wm. C. Hood and T. C. Koopmans, “The Estimation of 
Simultaneous Linear Economic Relationships,” Studies in Econometric Method , 
(New York: John Wiley and Sons, 1953). 

4 ) Theil points out to me that for treatment of structural change and ap- 
praisal of the a priori “reasonableness” of parameter estimates one would be 
primarily interested in the individual structural estimates. 
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this term in a general sense in this paper, that is in the sense 
of making estimates of endogenous economic magnitudes outside 
the realm of past experience. In this sense, a wide variety of 
problems of empirical economic analysis are forecasting problems. 

Propositions valid for partial systems may not carry over when 
complete systems are studied. Or even propositions valid for 
structural parameters may not be valid for reduced form 'para- 
meters . A beautiful property of the maximum-likelihood method 
of estimation is that its characteristics are preserved under 
single-valued transformation of variables. Let 0 be an unknown 

A 

parameter, 6 its maximum-likelihood estimate, and f{0) a 
single-valued transformation function. Then it follows that the 

A 

maximum-likelihood estimate of f(0) is given by /(0). As a 
result of this proposition, the desirable features of maximum 
likelihood estimates of structural parameters remain as desirable 
features when the parameters and estimates are transformed into 
reduced form coefficients. An analogous property does not hold 
for the method of least squares in general. 

Bias-Structural Equations and Reduced Forms 

First, let us consider the question of bias. By comparing one- 
by-one the coefficients of a system estimated by the method of 
least squares with corresponding coefficients estimated by some 
unbiased method, investigators sometimes conclude superficially 
that the amounts of bias are unimportant. From studies of 
numerical methods of solving linear equation systems with 
parameters subject to error, we learn, however, that small 
errors in coefficients may lead to sizeable errors in the final 
solution. 

Suppose that we have a linear equation system 

(1) By t + rz t = u t , y jointly dependent, 

z predetermined, 

with reduced form 

(2) y t = — J5" 1 n t + B- 1 u t . 

We shall denote a set of unbiased or consistent estimates as 

. A 

B and JT. Least-squares estimates will then be written as 
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B + D(B ), f+D(r). 

D(B) and D(/) are discrepancy matrices showing how the least- 

-v A 

squares estimates differ from the set B , /. For a given £ r vector 
we shall then be interested in the effect of the discrepancies on 
the solution vector. 

A first order approximation to the solution of 

[B + + D(y t )] + [f + D(r)]z t = 0 

is given by 

(3) D(y t ) = -B-'lD(r)z t + D(B)$ t ), 

where 

Except for the possibility that errors in the /'-matrix, /(/), 
compensate errors in the B-matrix, B(B), these original errors 
will be reflected in errors in the vector, D (y t ) after multiplication 
by j 8" 1 . In some problems Z? -1 can be a large factor, as will be 
illustrated below in a simple example. 

Consider, for illustrative purposes, the simple multiplier model 

(4) C t = <x.Y t + u t> 

(5) Y t = C t + I t . 

C t = consumption (endogenous) 

Y t = income (endogenous) 

I t = investment (exogenous) 
u t == random disturbance. 

The unbiased estimate of the marginal propensity to consume 
is a , and the biased least-squares estimate is a + e. 5 ) The bias, 
when looked at from a partial point of view is simply e, but when 
considered from the more general point of view of the whole 
system it is e/l — a — e, as can be seen from 

®) Haavelmo derives an explicit expression for the bias in this model and 
finds i? positive under fairly geneial conditions. T. Haavelmo. '‘Methods of 
Measuring the Marginal Propensity to Consume/' Journal of the American 
Statistical Association $ Vol. 42 (1947), pp. 105 — 122. 
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(6) 

where 


C t = (a + e)(C t + I t ) 

c = a+e ^ i = r * i 

* l—a—e 1 Ll— a (1— #)(1 — a— e) 




In the reduced form, or multiplier, equation the bias is mag- 
nified by the factor 1/1 — a — e y which will be of the order of 
magnitude of the multiplier. 6 ) Another way of looking of the 
matter is to observe that the percentage bias is smaller in 
absolute value in the structural equation than in the reduced 
form equation. 


e e 

a a( 1 — a — e) 


as long as |1 — a — e\ < 1. 


In this simple case, it is clearly seen that a discrepancy in 
the structural equation e y is magnified by the multiplier, 
1/1 — a — e. The multiplier plays the role of B~ x in the more 
general formulation (3), showing the possibility of comparatively 
small discrepancies becoming comparatively large in the final 
result. 


Least-Squares Efficiency — Structural Equation and Reduced Form 

This simple model of the multiplier process is useful in providing 
an example to show that Theirs proposition about the efficiency 
of least-squares methods cannot be extended to reduced form 
parameters. The efficiency properties of least-squares estimation 
of the parameter in the structural equation are not preserved 
under transformation to the reduced form, just as we know that 
the point values of least-squares estimates vary with the direction 
of minimization of squared residuals. 

The variance of the direct least-squares estimate of the marginal 

6 ) Haavelmo, op. cit., shows numerically how a difference of .06 in the 
marginal propensity to consume becomes a difference of .68 in the multiplier. 
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propensity to consume is given by 


var (a + e) = var 


ZC t Y t 

ZY* 


27(« Y t + u t )Y t 
ZY* 


= var 



Zu t Y A 
£Y* ) 


— var 


Zu t Y t 


Applying a first order approximation formula of a function 
of random variables to the right hand expression we get the 
classical type result. 7 ) 


( 7 ) 


var [a + e) — 


var (u) 
ZY* 


in the limit as the sample size -► oo. In deriving this result the 
standard independence assumptions are used about u t and Y t 
for unequal subscript values. Thus for large samples the classical 
formula is used even though u t and Y t are not independent (for 
equal subscript values). 

In forecasting Y t from estimates of the reduced form equation 


r*- 



1 


1 — a 


u t , 


we are interested in the variance of the estimated multiplier 

1 


est. 


1 — a 

The least-squares estimate of the structural equation leads to 
an estimate of the multiplier as 


l.s.est. 


1 


1 — cl 1 — a — e 
while the corresponding unbiased estimate will be written as 

1 1 


m.l.est. 


(m.l. = maximum likelihood). 


1 — a 1 — a 

We approximate the variance of the least-squares estimate as 


7 ) This is, of course, the same result that Theil derives by a different method 
in his ’comparison of the efficiency of least-squares and limited-information 
estimates. He deals with the estimated variance determined from the sample 
observations. 
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( 8 ) 


1 var (a + e ) 

VaX 1 - a- e = (1 - a - «)* = 

(- ) var (u)(ZY?) 

\1 — a — el 

(ZY\-ZC t Y t YZY\ 



(i - « 7 


var (w) ZY* 


The maximum-likelihood estimator is simply that multiplier 
value calculated from the least-squares regression of Y t on I t , 



With a positive bias, e > 0, we have the inequality 


var u 

(T=^ 


^ plim 


var (u) 

(I — a ~ e ) 2 


since plim a = a, 

and we assume 0<1 — a — e< 1. 
Also 

1 . ZYl 


hence 


var 


ZP t - (II t Y t ) 2 ’ 

(nb) Splimvar (rr 



Thus our example shows clearly that the efficiency properties 
of least-squares structural estimates do not always carry over 
to the reduced form equations. This example is, of course, 
special as well as simple. The maximum-likelihood estimate 
of the reduced form equation is simultaneously, or can be trans- 
formed into, a least-squares estimate of the reduced form, a 
full-information-maximum-likelihood estimate of the whole 
system, and a limited-information-maximum-likelihood estimate 
of the consumption function. All three of these identical esthnates 
are therefore superior to least-squares structural estimates, in a 
sense, for this model. 
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If a model is exactly identified, there exists a set of least- 
squares estimates that provide efficient estimates of the reduced 
forms. These are also maximum-likelihood estimates of the 
reduced form and can be transformed into maximum-likelihood 
estimates of the structural parameters. They are, however, very 
particular least-squares estimates and not those that are custom- 
arily obtained from the separate treatment of each structural 
equation. The least-squares estimates of the structural equation, 
in the form usually made, do not in general preserve their effi- 
ciency properties under transformation to reduced form para- 
meters. To put this in concrete terms of the simple model 
considered in this section, we note that the least-squares estimate 
of the marginal propensity to consume does not provide an 
efficient estimate of the multiplier (although a least-squares 
estimate of the multiplier would, in fact, be efficient). In over- 
identified systems, the lack of invariance of the efficiency 
property under transformation proves to be more important. 

In the next section, we shall compare efficiency of least- 
squares and full- and limited-information-maximum-likelihood 
estimates of the reduced forms, in more general cases, including 
overidentification. In another section, however, we shall present 
some results of a sampling experiment with small samples and 
overindentification, which show clearly the failure of least- 
squares estimates to retain efficiency under transformation. 

The Efficiency of A Priori Restrictions 

It has frequently been remarked that by imposing a priori 
restrictions on an economic model we gain efficiency of estimation 
because more information is brought to bear on the problem 
than is the case in the absence of such restrictions. 8 ) It is the 
purpose of this section to give a formal proof of this point of 
view and to interpret it. 

Liu has posed the seemingly paradoxical proposition: Least- 
squares estimates of the reduced form parameters in linear 
systems, ignoring all a priori restrictions, lead to a higher point 
on the likelihood function, assuming normally distributed disturb- 


8 ) See e.g., T. C. Koopmans and Wm. C. Hood, op. cit., esp. p. 176. 
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ances, than do restricted maximum-likelihood estimates; 
therefore the unrestricted estimates are to be preferred. 9 ) Put 
in another way, he observes that squared discrepancies between 
predicted and actual values of endogenous variables in a linear 
model will be smaller over the sample period if calculated from 
(unrestricted) least-squares estimates of the reduced form 
equations than if calculated from any set of estimates of structural 
parameters using a priori restrictions. 10 ) If the least-squares 
values of the reduced forms give better “predictions” over the 
sample period, should we not expect them to give better predic- 
tions outside the sample? 

While the least-squares estimates of the reduced forms are 
consistent and while they lead to minimal squared residuals; 
they do not lead to efficient estimates of the reduced form 
parameters. We shall now turn to the proof of the proposition 
that the more one uses valid information in the form of a priori 
restrictions imposed on the system, the more efficient are the 
estimates. 

Let us write a linear model as 

(1) By t + n t = u t , 
with reduced form 

(2) y t = -B-'rz t + B-iu t . 

y t , z t and u t are column vectors with n, m, and n components, 
respectively. B is a square, nonsingular, matrix of order nxn, 
while r is rectangular of order nxm. 

We can also write the reduced form equation as 

(10) y t = nz t + v t , 

obscuring the relation between its coefficients and those of the 

9 ) T. C. Liu, "A Simple Forecasting Model for the U. S. Economy/* Inter - 
national Monetary Fund Staff Papers, IV (August, 1955) 464 — 66, Liu*s ar- 
gument is actually more in terms of a supposed inherent tendency towards 
underidentification in models of an economic system than in teims of the 
point reached on the likelihood function with and without restriction. He uses 
the latter point, however, in justification of his claim of lack of identifiability. 
His general conclusion is that we can do no better than to make unrestricted 
least-squares estimates of the reduced forms. 

10 ) In the case of exact identification, least squares estimates of the reduced 
form parameters will coincide with fully restricted maximum likelihood estimates. 
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structural equations. On multiplying both sides of this equation 
by B, however, we find 

( 11 ) Bn = -r, 

by equating coefficients of like variables in the reduced form and 
structural equations. Some elements of F are zero; therefore 
BII has the same zero elements. These are the restrictions. 
Let us write, symbolically, 

( 12 ) (Bn), = 0 

to show that r elements of F are zero, and that corresponding 
elements of Bn are zero. 

The logarithm of the likelihood function of the entire system 
can be written as 

T T 

(13) L = const. — — log |jQ„| + A'(S/7) r . 


Q v = matrix of variances and covariance of elements of v t . 
X is a vector of Lagrange multipliers with r non-zero elements. 
If there were no restrictions on the system, variances and 
covariances of est. II are given by 


8 2 L - 1 -i 

) —r-ir = ° i,M « = ■ 

07l ik OJCj l 

a ij = typical element of X?" 1 . 
o {i = typical element of Q v . 

M xz — moment matrix of predetermined variables, z t . 


The elements of this inverse matrix (14) are the variances 
and covariances of unrestricted least squares estimates of il. 
For any particular equation in this complete set of reduced 
forms, the appropriate variance-covariance matrix is a u M~ x . 
To determine the variance of forecast for any endogenous 
variable, outside the sample values, we have, by Hotelling's 
formulation of the problem of prediction error, 

(15) S% = a«(l + z' p Mj x z p ), 

where z p denotes a vector of values assigned to predetermined 
variables in the forecast period. 
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Suppose now that we have r restrictions on the maximization 
of the likelihood function, then the variances and covariances 
of est. II are given by the N.W. principal minor. A, on the 
right hand side of the following expression r 


(16) 

d 2 L | 

dftijc dttji j 

A 

-1 

A 

A 


A' 

0 




A' 

0 


The bordering matrix A consists of elements of A obtained as 
—d 2 Lldn ij db kl , where b kl are elements of B. 

Theorem: Consider an mxm principal minor of A correspond- 
ing to the i- th reduced form equation. Denote this principal 
minor by A ip . 

It follows that 

z' F A {p z F ^ a u z' f M~}z p . 

Proof: Denote a vector of mn elements by w. 


(17) 

Let det. 

d 2 L 

dn ik djiji 

A 



x 

0 


where A is a vector. 
Form the difference 


mn mn 

2 A m . i} w t w j 2 A u w t w, 

ft __ UZI . 

A oo A 

A oo is formed from A by deleting the last row and column. 
Aoo.u is formed from A 00 by deleting the i-th row and /-th column. 

mn tnn 

A 2 Aoo.is w i w i ~ A oo 2 AijWiWj 

S = Mzl 

AAqq 

By Jacobi’s theorem on determinants 11 ) we have 

u ) See e.g., A. C. Aitken, Determinants and Matrices (London: Oliver and 
Boyd, 1942), pp. 98 — 99. 
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— ^00 ^ is 


therefore 


(18) 


mn 

2 A w A w w x w, 

1,3=1 


( mn \ 2 


AA 


oo 


AA 


00 


The difference, <5, is positive since A 00 is a positive definite quad- 
ratic form. Determinants of positive definite quadratic forms 
obtained by successive rows and columns of bordering alternate 
in sign; hence 

^l^oo ^ 0* 


Thus by bordering 
(19) det. 


d 2 L 

djt ik djij z 


— ^00 


with one row and column, we find 

(20) w' (A^ qqW <: w'iA^^w. 


By bordering A with a row and column to form ( n A ) , we similarly 
find 

( 21 ) ®'[(n^) -1 ]oo.oo^ ^ w'iA-^ooW, 

and so on for successive bor derings. By letting all elements of w 
vanish except the m elements in z F corresponding to the Mx.m 
principal minor of A associated with i-th reduced form equation, 
we have 

(22) z'pA^Zp-g. a H z w M~l z F , 


as was to be proved. 

The total variance of forecast from a system of structural 
equations is composed of two factors, the variance of disturbances 
a u and a quadratic form (plus unity) in the assumed values of 
the predetermined variables. The matrix of this quadratic form 
when multiplied by a u is the variance-covariance matrix of 
reduced form parameter estimates. By adding information to 
the5 system in the form of restrictions on the parameters, we 
decrease the magnitude of this quadratic form. The estimated 
value of forecast error will reflect differences in estimation 
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procedure, the calculated variance of residuals being lowest 
when no restrictions are used. Nevertheless, the underlying or 
inherent error using the true value of a u in the formula for 
prediction error will be smaller, the more one Employs a -priori 
restrictions in calculating the structural characteristics. 

The succession of inequalities derived shows that full-infor- 
mation-maximum-likelihood estimates will be more efficient 
than limited-information-maximum-likelihood estimates since 
more restrictions are used with the former than with the latter 
set. Both of these methods, in turn, will be more efficient than 
unrestricted least-squares estimates of the reduced forms. 

In connection with limited-information estimates a word of 
explanation is in order. When any single equation in a system 
is being estimated, this method will produce a set of reduced 
form estimates, separately involving each of the endogenous 
variables in that equation. In so far as a single endogenous 
variable appears in several different structural equations, there 
will be several possible reduced form equations that could serve 
as alternative forecasting equations. The best among these will 
be that set which yields the smallest quadratic form 

z F A ix z F . 

This will be inferior to the full-information-maximum-likelihood 
estimates. Another way of using limited-information estimates 
in practical forecasting, is by solving algebraically for reduced 
form equations in a structural system, each equation of which 
has been estimated by the method of limited information. 12 ) 

The foregoing results on the efficiency of the use of a priori 
information follow closely a development by Samuelson called 
“the generalized Le Chatelier principle.” 18 ) Samuelson shows 
that the sensitivity of an economic variable to a parameter 
change, e.g., price elasticity of demand, decreases as the number 
of restraints imposed upon the system increases. He proves 
this proposition by bordering a matrix of second derivatives of 
a function being maximized. In a sense, this paper extends his 

la ) L. R. Klein and A. S. Goldberger, An Econometric Model of the United 
States , 1929 — 1952 (Amsterdam: North-Holland Publishing Co., 1955). 

1S ) P. A. Samuelson, Foundations of Economic Analysis (Cambridge: Harvard 
•University Press, 1947), pp. 36 — 39). 
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result from inequalities on diagonal elements of inverse bordered 
matrices to quadratic forms of principal minors. 14 ) If we were 
interested solely in variances of individual coefficients in the 
reduced form, Ms result would be directly applicable. Since the 
forecast error involves an entire quadratic form, his proposition 
must be extended. It may also be remarked that Samuelson's 
proposition is not a perfect analogue for ours. His bordered 
matrices are those familiar in establishing conditions for ex- 
tremes, while ours would have to be bordered further to take 
that form. 

Some Experimental Sampling Results 

The argument so far has been based on asymptotic theory. 
In a constructed experiment with repeated small samples, 
G. W. Ladd has produced some extremely interesting findings 
relevant to the problems of this paper. 15 ) His model consists 

of two overidentified (demand and supply) equations. 

(23) y lt = 0 lt y„ + y n z lt + y 12 z 2t + u lt 

Vlt — PziVto + Yz3 Z 3t 4 " Y2i Z 4t 4 " U 2f 
He assigned specific population values to the structural coef- 
ficients and drew random numbers for the disturbances and the 
exogenous variables. The endogenous variables, y lt and y 2t , 
were then computed from the structural equations in (23). 
Ladd was particularly interested in the effects of errors of 
observation; therefore he added to each of the endogenous and 
exogenous variables an error of observation. 

Thirty samples of thirty observations each were obtained by 
drawing the six sets of observation errors repeatedly, while 
the disturbances u iti are not changed from sample to sample. 
The errors in y lt are, together with linear functions of u it> values 
of reduced form disturbances; thus the model differs from that 
treated above by having observation errors in the other variables. 
For each sample, Ladd computed from the observations on the 

14 ) Samuelson’s demonstration in the first printing contains some minor 
compensating errors, but the final results are quite correct. 

is) g. w. Ladd, “Effects of Shocks and Errors in Estimation: An Empirical 
Comparison,” Journal of Farm Economics, XXXVIII (1956), pp. 485 — 95. 
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y * s and z’s, estimates of the structural parameters by the method 
of least squares and the method of limited information. 16 ) From 
each of these two types of estimates we can derive, alge- 
braically, reduced form estimates in each sample. 

If we denote least-squares estimates by a ^ and limited 
information estimates by a a sign, we can derive two possible 
reduced forms for each sample, 


Vi t — 7n ^ + 
/a A \ ~ P 12 

( 24 ) —72 3 


ft 


12 


P 22 ft: 


ftn - ft 


8 ~~ z 3 t — 7 24 


d 2,1 + ( 

ft 12 


1 + 


Pl2 


P 22 ““ ft 12 ' 


‘ 2 1 


12 


ft 22 -ftl 2 


Zf2t — 3 3 z lt 3 o 3 R R R 

r 22 r 12 P22”*Pl2 P22~~Pl2 P22 — Pl2 

or the same equations with y n replaced by y n , y 12 by y 12 , etc. 

Alternatively, without taking any of the identifying restric- 
tions into account, we can postulate general reduced form 
equations 


(25) ^ ^ llZlt ^12 2 2i + 'PlS^t + ^14% 

V2t ~ p21 Z lt + p22 Z 2t + p2Z Z 3t + ft2± Z ±t 
whose coefficients are estimated from the least-squares regres- 
sions of y lt and y 2t on z lti z 2t , z 3t and z 4t . These regressions are 
obtained as a by- product of the limited-information estimates. 

The accompanying table gives the results of our computations 
from Ladd's data. For the three methods of estimating the 
reduced forms, I have calculated the mean and standard devia- 
tion of the thirty sample values of each parameter. The com- 
putations in the table assume, in effect, that there are thirty 
independent samples; however, the fact that the vectors 
(u iv u i2 , . . . % 30 ) are unchanged in repeated samples, means 
that the composite reduced form disturbance, consisting of a 
linear function of u it and observation error, contains a common 
element in each sample. Since the variance of this linear function 
of u it is not small relative to the variance of observation error 
there is significant correlation in the random terms from sample 


u ) In the least-squares regressions, y lt is selected as the dependent variable. 
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to sample. In comparing the different methods of estimation 
we do not have as much information as would be given by 
thirty independent samples. 


Table 

Sample Means and Standard Deviations of Estimates of Reduced Form Parameters 
Obtained by Three Methods 


1. Reduced form coefficients derived from least-squares estimates of structural 
parameters. 



Population 

Sample 

Sample standard 


parameter 

mean 

deviation 

Pll 

.05 

.011 

.017 

Pu 

.27 

.094 

.083 

Pis 

.06 

.116 

.040 

Pu 

.11 

.127 

.036 

Pn 

-.16 

-.295 

.280 

P22 

-.90 

-2.054 

.360 

P 23 

.30 

.670 

.224 

Pzi 

.56 

.770 

.323 


2 . Reduced form coefficients derived from limited-information estimates of 
structural parameters. 


Pn 

.05 

.045 

.046 

Pu 

.27 

.254 

.042 

Pis 

.06 

.037 

.026 

pu 

.11 

.123 

.046 

pu 

-.16 

-.127 

.118 

P22 

-.90 

-.957 

.132 

P22 

.30 

.175 

.089 

P2A 

.56 

.559 

.129 

Reduced form coefficients estimated directly from least-squares regressions 
of y lt and y 2t on z lt z u , z 3t , and z u . 

pn 

.05 

.028 

.094 

P12 

.27 

.258 

.049 

Pis 

.06 

.044 

.071 

Pu 

.11 

.121 

.048 

P21 

-.16 

-.027 

.314 

P22 

-.90 

-.951 

.138 

P22 

.30 

.091 

.242 

Psi 

.56 

.576 

.151 


In comparing the first two methods, least-squares and limited 
information structural estimates, the bias of the former is clearly 
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shown. 16 ) In all cases, the mean of the limited-information 
estimate is closer than the mean of the least-squares estimate to 
the population value. In six of eight cases, the sample standard 
deviation of parameter estimates is lower for limited-information 
than for least-squares estimates. This is a reflection of the 
proposition that, in general, the optimal properties of least- 
squares estimates are not preserved under transformation. 17 ) 

In comparing the last two methods, it can be seen that 
neither shows much bias, which is as expected, but that limited- 
information estimates have uniformly smaller variance. This 
is a small sample exposition of the results derived in the preceding 
section. 

Ladd's results are extremely suggestive for our purposes though 
not conclusive. Of course, results from constructed experiments 
do not constitute general proof of investigated propositions, 
but Ladd's experiment is far less than perfectly designed for 
the particular objectives of this paper. It would be desirable to 
carry through similar experiments without the complication of 
observation error. Even within the framework of Ladd's ex- 
perimental design, there are changes that could be made to 
strengthen the present inquiry. Both least-square structural 
regressions were made with y lt as the dependent variable, but 
other choices deserve to be investigated in the present context. 
This is especially true since the population correlation between 
u 2t and y 2t is as high as 0.56. It is the correlation between 
disturbances and independent variables in regression analysis 
that introduces bias. The results in the table are also restrictive 
in that they deal only with variances and neglect covariance. 
A single statistic constructed along the lines of the standard 
error of forecast would be more adequate. In a future sampling 
inquiry some of these deficiencies can be remedied. 


16 ) Limited information estimates are consistent but not generally unbiased. 
In small samples of the type under consideration, bias may be revealed in them 
as well. Here the argument is simply that limited information estimates reveal, 
in the sampling experiment, a smaller degree of bias. 

17 ) The standard deviations are computed about sample means. If tlfey were 
computed about population means, the least-squares results would compare 
even less favorably. 
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